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Quantum Monte Carlo (QMC) is an advanced simulation methodology for studies of many- 
body quantum systems. The QMC approaches combine analytical insights with stochastic 
computational techniques for efficient solution of several classes of important many-body 
problems such as the stationary Schrodinger equation. QMC methods of various flavors have 
been applied to a great variety of systems spanning continuous and lattice quantum models, 
molecular and condensed systems, BEC-BCS ultracold condensates, nuclei, etc. In this re- 
view, we focus on the electronic structure QMC, i.e., methods relevant for systems described 
by the electron-ion Hamiltonians. Some of the key QMC achievements include direct treat- 
ment of electron correlation, accuracy in predicting energy differences and favorable scaling 
in the system size. Calculations of atoms, molecules, clusters and solids have demonstrated 
QMC apphcabihty to real systems with hundreds of electrons while providing 90-95% of 
the correlation energy and energy differences typically within a few percent of experiments. 
Advances in accuracy beyond these hmits are hampered by the so-called fixed-node approx- 
imation which is used to circumvent the notorious fermion sign problem. Many-body nodes 
of fermion states and their properties have therefore become one of the important topics for 
further progress in predictive power and efficiency of QMC calculations. Some of our re- 
cent results on the wave function nodes and related nodal domain topologies will be briefly 
reviewed. This includes analysis of few-electron systems and descriptions of exact and ap- 
proximate nodes using transformations and projections of the highly-dimensional nodal hy- 
persurfaces into the 3D space. Studies of fermion nodes offer new insights into topological 
properties of eigenstates such as explicit demonstrations that generic fermionic ground states 
exhibit the minimal number of two nodal domains. Recently proposed trial wave functions 
based on pfafflans with pairing orbitals are presented and their nodal properties are tested 
in calculations of first row atoms and molecules. Finally, backflow "dressed" coordinates 
are introduced as another possibihty for capturing correlation effects and for decreasing the 
fixed-node bias. 
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1 Introduction 

Properties and behavior of quantum many-body systems are determined by the laws of quantum 
physics which have been known since the 1930s. The time-dependent and stationary Schrodinger 
equations describe molecules, condensed matter systems, surfaces and interfaces, but also arti- 
ficial structures such as quantum dots, nanowires, ultracold condensates and so forth. However, 
the task of solving the Schrodinger equation for systems of electrons and ions and predicting 
the key quantities of interest such as cohesion and binding energies, optical gaps, chemical reac- 
tion barriers, types of magnetic order or presence of quantum condensates seems to be beyond 
formidable. Indeed, Paul Dirac recognized this already in 1929: "The general theory of quantum 
mechanics is now almost complete. The underlying physical laws necessary for the mathemati- 
cal theory of a large part of physics and the whole chemistry are thus completely known, and the 
difficulty is only that the exact application of these laws leads to equations much too complicated 
to be soluble." Let us therefore try to understand the key obstacles. The first complication comes 
from the fact that a typical system of interest involves large number of particles. Even moderate 
size molecules or periodic supercells of solids contain from tens to thousands or more quantum 
particles. Second, the particles interact and for interesting problems the interactions are strong 
and influence the systems in profound ways. Third, the solutions have to conform to the required 
quantum symmetries such as fermionic antisymmetry and others. This is a fundamental depar- 
ture from classical systems and makes the problem particularly challenging. Fourth, the required 
accuracy for sensible predictions, which are comparable with experiments, is very high. 

Past studies of quantum systems were often based on ideas of reducing the interactions into 
various types of effective mean-fields. These approaches gradually developed into a high level of 
sophistication and despite limitations, they are still very useful and important on their own or as 
starting points for building new theories. At the same time, advent of computer technology has 
offered us a new window of opportunity for studies of quantum (and many other) problems. It 
spawned a "third way" of doing science which is based on simulations, in contrast to analytical 
approaches and experiments. In a broad sense, by simulations we mean computational models of 
reality based on fundamental physical laws. Such models have value when they enable to make 
predictions or to provide new information which is otherwise impossible or too costly to obtain 
otherwise. In this respect, QMC methods represent an iUustration and an example of what is the 
potential of such methodologies. 

Some of the ideas used in QMC methods go back to the times before the invention of elec- 
tronic computers. Already in 1930s Fermi noticed similarities between the imaginary time 
Schrodinger equation and stochastic processes in statistical mechanics. Apparently, scientists 
at the Los Alamos National Laboratory attempted to calculate the hydrogen molecule by a sim- 
ple version of QMC in the early 1950s, around the same time when a seminal work on the first 
Monte Carlo study of classical systems was published by Metropolis and coworkers. In the late 
1950s Kalos initiated QMC simulations for a few-particle systems and laid down the statistical 
and mathematical background of the Green's function Monte Carlo method. Pioneering QMC 
calculations of the correlation energy of homogeneous electron gas by Ceperley and Alder in 
1980 started a new era of stochastic methods applied to electronic structure problems. QMC 
methods were then further developed for quantum lattice models, quantum liquids, nuclear and 
other systems with contributions from many scientists. 

Currently, the term "quantum Monte Carlo" covers several related stochastic methodologies 
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adapted to determine ground or excited states or finite-temperature equilibrium properties for a 
variety of quantum systems. Note that the word "quantum" is important since QMC approaches 
differ in many ways from the Monte Carlo methods for classical systems. In this overview we 
present QMC methods which enable to solve the stationary Schrodinger equation for interact- 
ing Hamiltonians in continuous space. QMC reaches the high accuracy by a direct treatment of 
many-body effects and by employing stochastic computational techniques which enable to cap- 
ture many-body effects well beyond the scope of mean-field or analytical methods. Conceptually 
very straightforward is the variational Monte Carlo (VMC) which involves construction of trial 
(variational) wave functions and evaluations of corresponding expectation values by stochastic 
integration. More advanced approaches are based on projection techniques such as the diffu- 
sion Monte Carlo (DMC) which projects out the ground state from a given trial wave function. 
The method is formally exact, however, for fermions it suffers from a well-known fermion sign 
problem. The reason is that in using statistical methodology we tacitly assume that the sampled 
distributions are nonnegative. Unfortunately, for fermions this not true and in order to circumvent 
this obstacle many practical implementations of DMC employ the so-called fixed-node approxi- 
mation. This introduces the fixed-node error which is one of the focal points of this overview. 

Elimination or, at least, alleviation of the fixed-node errors is therefore of interest from both 
fundamental and practical point of views — since this is essentially the only approximation which 
is not deeply understood and also because its full control requires inordinate computational re- 
sources. At first, the nodal problem looks intractable since many-body wave functions nodes are 
complicated high-dimensional hypersurfaces determined by the full solution of the Schrodinger 
equation. However, study of nodal structures has revealed a number of important properties of 
fermionic states such as nodal domain topologies and their relationships to ideas in geometry and 
spectral theory of operators. Therefore, some of the latest research developments on fermionic 
wave function nodes will be presented and explained. 

Along the same line of the research, we outline current developments in construction of 
efficient and accurate wave functions which explore ideas beyond the traditional determinants of 
single-particle orbitals. Recently proposed Pfaffian wave functions built from pairing orbitals, 
which provide higher accuracy and also correct nodal topologies, will be explained [1,2]. In the 
subsequent part, a fermion coordinate transformation of the backflow type which was recently 
tested for inhomogeneous systems is introduced. We generalize its application to Pfaffian wave 
functions and illustrate the results on some of the first benchmarks of these ideas. 

Fixed-node QMC simulations are computationally more demanding by two to three orders of 
magnitude when compared to the mainstream electronic structure methods which rely on mean- 
fields treatments of electron-electron interactions. However, traditional mean-field theories often 
lack the needed accuracy and also require frequent re-tuning, sometimes just to qualitatively rec- 
oncile the results with experiments. Although at present, building of accurate many-body wave 
functions is far from being automatic and demands very significant effort as well, the results of- 
ten provide high accuracy and become benchmarks for other methods. For cohesive and binding 
energies, band gaps, and other energy differences the agreement with experiments is typically 
within 1-3% [3-5]. The computational cost of QMC increases basically as a third power of the 
system size, making calculations with hundreds of electrons tractable and large clusters [6] and 
solids [7] up to 1000 electrons have been already studied. Producing accurate "numbers" has 
its own merit and the predictive power is particularly valuable for experiments and for practi- 
cal purposes. However, even more important are new insights into quantum phenomena and 
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detailed analysis of theoretical ideas. Indeed, QMC is very much in the line of "it from bit" 
paradigm, alongside, for example, of substantional computational efforts in quantum chromody- 
namics which not only predict hadron masses but, at the same time, contribute to the validation 
of the fundamental theory. Similar simulations efforts exist in other areas of physics as well. 
Just a few decades ago it was almost unthinkable that one would be able to solve Schrodinger 
equation for hundreds of electrons in an explicit, many-body wave function framework. Today, 
such calculations are feasible using available computational resources. At the same time, much 
more remains to be done, of course, to make the methods more insightful, more efficient and 
their application less laborious. We hope this overview will contribute to the growing interest in 
this rapidly developing field of research. 
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The review is organized as follows: 

• The remainder of this Section provides brief introduction into several traditional methods 
of solving the stationary Schrodinger problem for electron-ion systems. 

• The next section follows with the description of the methodology behind the VMC and 
DMC methods. 

• Results of studies on exact and approximate nodes of fermionic wave functions are sum- 
marized in the Sec. 3. 

• Section 4 discusses generalized pairing wave functions and their properties. 

• Overview of preliminary results on the backflow wave functions is given in the section 5. 

• Section 6 shows the example of large scale QMC apphcation: FeO solid. 

• A QMC package Qwalk is introduced in Sec. 7. 

• The last section concludes with comments and ideas on current developments and out- 
looks. 

1.1 Many-Body Schrodinger Equation 

Let us consider a system of quantum particles such as electrons and ions with Coulomb inter- 
actions. Since the masses of nuclei and electrons differ by three orders of magnitude or more, 
let us simplify the problem by the Bom-Oppenheimer approximation which allows to separate 
the electrons from the slowly moving system of ions. The electronic part of non-relativistic 
Born-Oppenheimer Hamiltonian is given by 

where i and j label the electrons and while / sums over the ions with charges Z/. (Throughout 
the review we employ the atomic units, a.u., with = me = e = 1.) The basic properties of 
such systems can be foimd from eigenstates ^„ of the stationary Schrodinger equation 

H^^ = S„*„. (2) 



Colloquially, we call such solutions (exact or approximate) and derived properties as electronic 
structure. 

The history of electronic structure ab initio methods (see, for example, Ref. [8]) started 
soon after the discovery of quantum mechanics by pioneering works of Heitler and London [9], 
Hartree [10], and Hylleraas [11, 12]. An important step forward was made by Fock [13] who 
established the simplest antisymmetric wave function and formulated the Hartree-Fock (HF) the- 
ory which takes into account the Pauli exclusion principle [14]. The theory was further developed 
by Slater [15] and others and it has become a starting point of many sophisticated approaches not 
only in condensed matter physics or quantimi chemistry but also in other areas such as nuclear 
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physics. The HF theory replaces the hard problem of many interacting electrons with a system 
of independent electrons in an effective, self-consistent field (SCF). 

For periodic systems the effective free electron theory and the band theory of Bloch [16] 
were the first critical steps towards understanding real crystals. In 1930s, the foundation of 
the basic classification of solids into metals, semiconductors and insulators was laid down and 
later Wigner and Seitz [17, 18] performed the first quantitative calculation of electronic states 
in sodium metal. Building upon homogeneous electron gas model, the density functional theory 
(DFT) was invented by Hohenberg and Kohn [19] and further developed by Kohn and Sham [20]. 
Subsequent developments proved to be very successful and DFT has become the mainstream 
method for many applications which cover not only condensed systems but also molecules, sur- 
faces, even nuclear and other systems. DFT together with HF and post-HF methods are relevant 
for our discussion of quantum Monte Carlo, which uses the results from these approaches as a 
reference and also for building wave functions. What follows is their brief overview. 



1.2 Hartree-Fock Theory 

Due to the Pauli exclusion principle, any solution of the Schrodinger equation with Hamilto- 
nian (1) has to be antisymmetric imder coordinate exchange of any two electrons as 



(3) 



where i, j denote both spatial and spin degrees of freedom. The Hartree-Fock theory [13, 15] is 
based on the simplest antisymmetric wave function, sometimes called the Slater determinant. 



*(1, 2, . . . , TV) = (l)^i.(2) • . • 'Pi AN) 



(4) 



(^i(l) ^i(2) 

^2(1) ^2(2) 

<^jv(l) <^iv(2) 



<Pn{N) 



= det[^i(l),...,<^jv(A^)], 



to approximate the state of A^-electron system. are one-particle spin-orbitals, each being 

a product of the spatial (pf{j) and spin ai{(Tj) orbitals. (Note that ffiJ) is independent of spin 
<j in closed-shell cases. In open-shell systems, this assumption corresponds to the spin-restricted 
Hartree-Fock approximation). The Hartree-Fock total energy can be found from the following 
variational condition 



= min; {ipj\^j) = 1, j = 1, ...,N. 



(5) 
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If the Hamiltonian is spin-independent or it is diagonal in the spin basis a = {| t); | 4-)}) the 
expectation value of Hamiltonian (1) with the wave function (4) is given by 



E 



HF 



^-(r)^-(r')¥'J(rX(r') 



<(r)dr 

1 J2 /V¥'r*W'P.?*(r')^rWv'.?(r') 

- ^ E// dr dr'. (6) 

The expression for the HF energy contains three different terms. The first one represents one- 
particle contributions, the second is the Coulomb interaction also called the Hartree term and 
the last contribution is the exchange or Fock term. Note that for i = j (self-interaction) the last 
two terms explicitly cancel each other. Variation of the HF total energy leads to one-particle 
HF equations with the resulting spectrum {ip^ , Cj} of occupied and virtual orbitals obtained by 
solving 



-^V2 + yext + ^HF({<}) 



<(r) = ei^1{v) (7) 



The effective field operator Vrf contains both Coulomb and exchange terms and due to the self- 
consistency depends also on the set of occupied orbitals. The resulting operator is both nonlinear 
and nonlocal as manifested by the fact that Vrf, in general, differs for each state i. This compli- 
cated dependence makes the solution of HF equations rather involved. The solutions are usually 
expanded in an appropriate basis sets and iterated to self-consistency. Sophisticated and pow- 
erful tools such as well-developed and extensive quantum chemical or solid state packages are 
necessary for such solutions. For solids the HF equations were out of reach for a long time and 
only in late eighties the first package had emerged which was able to find reasonably accurate 
and converged solutions. 



1.3 Post Hartree-Fock Methods 

The HF energy is variational, i.e, above the exact eigenvalue, and whatever is left out is referred 
to as electronic correlation. For a given state, the correlation energy is therefore defined as a 
difference of the HF energy and the exact eigenvalue, 

Ecorr = Ehf — E. (8) 

In electron-ion systems Ecorr is only a small fraction of the total energy, of the order of ~ 1-3 
%, however, correlations impact almost all quantities in a major way. For example, correla- 
tion accounts for a large portion of cohesions, excitations, barrier heights and other important 
energy differences. Many other effects such as magnetism or superconductivity are driven by 
correlations so that its accurate description becomes crucial. Revealing the nature of correlation 
effects and how they influence the behavior of quantum systems is therefore one of the central 
challenges of modem electronic structure research. 
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The missing correlation in the HF theory can be accounted for by expanding the wave func- 
tion into linear combination of Slater determinants with one-, two-, A^-particle excitations. 
Providing the one-particle set of states is complete and all possible excitations are covered one 
can formally reach the exact solution. Unfortunately, the convergence of such methods is slow 
and the computational demands grow exponentially with the system size. Nevertheless, for small 
systems such methods are effective and provide valuable checks for other approaches. The most 
frequently used post-HF approaches are configuration interaction (CI), multi-configurational 
self-consistent field (MCSCF), and coupled cluster (CC) methods. The idea behind CI is to 
diagonalize the A/'-electron Hamiltonian on the space of orthogonal Slater determinants. These 
determinants are typically constructed as few-particle excitations from the reference determinant 
(usually the HF solution) using the virtual HF orbitals. The CI method thus provides the most 
optimal expansion on such determinantal space. The most commonly used is CI with all the 
single and double excitations (CISD). Besides the slow convergence in one-particle orbitals the 
method becomes problematic with increasing N since the recovered correlation is proportional 
to VN (instead of N), thus leading to the well-known size-consistency problem (i.e., a wrong 
scaling of total energy with N). 

The MCSCF is a generalization of the expansion in excited determinants and differs from 
CI by optimization of both of the orbitals and the determinantal coefficients. Such optimizations 
are quite difficult and the demands often limit the size of the problem which can be studied. The 
special case of the MCSCF is the complete active space SCF (CASSCF), which is size consistent, 
at the price of exponential scaling in the number of particles. 

The size-consistency problem can be overcome by going beyond linear expansions as it is 
done in the coupled cluster method [21]. The wave function ansatz is based on exponentiated 
excitation operators and the corresponding optinnized parameters are found by solving a set of 
coupled nonlinear equations. The computational cost for CCSD (with singles and doubles) scales 
as and therefore constitutes a formidable challenge for larger systems. The commonly used 
CCSD(T) (CC with single, double and perturbative triple excitations) is often used as a standard 
in accuracy for small systems. For an overview and detailed introduction into the above men- 
tioned quantum chemical methods we refer the reader, for example, to the book by A. Szabo and 
N. S. Ostlund [22]. 



Previous methods were examples of the wave function based theories. For the density functional 
theory (DFT) the primary object is the one-particle electron density, i.e., the diagonal part of 
one-particle density matrix. DFT is formally an exact method based on Hohenberg and Kohn 
theorem [19] that states that the ground state properties of a many-electron system can be ob- 
tained by minimizing the total energy functional 



where the F[/9(r)] is some unknown, universal functional of the electronic density p{v). Assum- 
ing such functional would be known, the total energy E would exhibit a minimum for the exact 
ground state density p(r). 



1.4 Density Functional Tlieory 




(9) 
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Kohn and Sham [20] introduced an ansatz for the density in terms of one-electron orbitals of 
an auxiliary non-interacting system as 



N 



p{T) = J2\^i{r)f. (10) 

i 

The total energy of electronic system can be then expressed as 

m^)] =-\Y.j WVVi(r) dr+ j p(r)Kxt(r) Av+\ j j ^^^^ AvAv'+E^M^)]. 

(11) 

where the first two terms represent the familiar energy of a non-interacting system in the ex- 
ternal potential, the third term being the Hartree term while the rest is an unknown universal 
exchange-correlation functional. By minimization of the total energy as a functional of normal- 
ized occupied orbitals one can derive the Kohn-Sham equations 



<f'({v) = ei^1{v). (12) 



The structure of the equations appears similar to the HF equations, the key difference is that 
the effective exchange-correlation operator is nonlinear yet local, since it depends only on the 
density but not on individual states. This feature of the DFT theory simplifies the calculations 
considerably and it has enabled to expand the DFT applicability to very large systems. If the 
functional £'a;c[/5(r)] would be precisely known, we would have one to one mapping between 
difficult many-electron system and an effective one-particle problem. The fundamental difficulty 
is that the exact exchange-correlation functional is unknown and therefore we have to rely on 
approximations. 

The simplest is the local density approximation (LDA), 

£.Jp(r)] = j e^,^«[p(r)]p(r)dr, (13) 

where e^'^^ is the exchange-correlation energy of the homogeneous electron gas of the same 
density. It is interesting that basically in all practical implementations of LDA, the correlation 
portion of this energy functional comes from the pioneering QMC calculations of the homoge- 
neous electron gas [23]. 

It is intuitively suggestive to generalize the exchange-correlation functional to include also 
the density derivatives, i.e., to express ea;c[p(r), Vp(r)] as a function of the local density and its 
gradient. This was the essential idea behind the generalized gradient approximation (GGA) [24— 
27], which provide improved results for some quantities and has enabled wide-spread use both 
in condensed matter physics and quantum chemistry. 

There are many flavors and generalizations of the density functional theory, e.g., hybrid- 
functionals [24, 26, 27], time-dependent DFT (TD-DFT) [28, 29] for calculations of excited 
states, extensions to non-local density fimctionals (averaged density approximation (ADA) [30] 



12 



M. Bajdich and L. Mitas 



and weighted density approximation (WDA) [30]), or orbital-dependent functionals (e.g. self- 
interaction corrections (SIC) [31] and LDAh-U [32]). For some systems the DFT methods are 
very effective and provide reasonable accuracy at very low cost. Unfortunately, there are many 
cases where DFT requires further elaboration: introduction of ad hoc parameters such as in 
LDAh-U approach or perturbative corrections such as in the so-called GW method. Even then 
for many systems such as transition metal compounds, weakly interacting van der Waals systems 
and other cases the DFT has profoimd difficulties in reaching the required accuracy. 
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2 Quantum Monte Carlo Methods 



2.1 Introduction 

Quantum Monte Carlo (QMC) is a suite of approaches based on two complementary strategies 
for solving quantum many-body problems. The first key component is an analysis of many-body 
effects at the wave function level: wave function analytical structure and impact of potential 
singularities, efficient functional forms, handling of symmetries and boundary conditions, ex- 
plicit multi-particle correlations and construction of intermediate effective Hamiltonians which 
enable to build-up more accurate many-body treatment. The second key ingredient is based on 
employing stochastic techniques whenever it is impractical or inefficient to use analytical and 
explicit constructions. This is done on several fronts: evaluations of high-dimensional integrals, 
solving of quantum equations by mapping onto equivalent stochastic processes, development of 
algorithms and numerical approaches for high accuracy and efficiency. 

Over the last three decades, the impact of QMC has been steadily growing as scientists ex- 
plored new algorithms and produced results which were otherwise impossible to obtain by tra- 
ditional methods. Currently, QMC covers a great variety of specialized approaches tuned to 
particular Hamiltonians and applications. In this review we will briefly describe the variational 
and diffusion quantum Monte Carlo methods. 

In the variational Monte Carlo (VMC) the expectation values are evaluated via stochastic 
integration over SA^ -dimensional space using trial (variational) wave functions constructed to 
capture the desired many-body phenomena. The variational theorem ensures that the expectation 
value of the Hamiltonian is, within the statistical error bar, a true upper bound to the exact ground- 
state energy. The results are determined by the choice and quality of the trial wave function 
construction and therefore are subject to a variational bias. The second described method, the 
diffusion Monte Carlo, removes most of the variational bias by projecting out the ground state 
component from any trial function with nonzero overlap within the given symmetry class. The 
fixed-node approximation enables to avoid the fermion sign problem and makes the method 
scalable to large sizes. 



2.1.1 Metropolis Sampling 

Basic Monte Carlo multi-variate integration approaches are based on the following simple idea. 
Assume that we are interested in finding an expectation value of j4(R) for a known non-negative 
distribution P(R) 

f ^(R)P(R)dR 

= /V(R)dR ' '''' 

where R is a position in dimensional space. This is a common expression for evaluation 
of observables encountered in classical or quantum statistical mechanics. If the dimension of 
space is large and the integral cannot be reduced to sums and/or products of low-dimensional 
integrals, its straightforward evaluation scales exponentially in the dimension d. This is easy to 
see by considering a simple rectangular integration rule with Ng grid points in each dimension. 
The total number of evaluations will be (Ng)"^ and therefore not feasible even for moderate 
values of Ng and d. The key advantage of stochastic methods is that they are immune to the 
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dimensionality problem under reasonable conditions which are easy to verify (see below). Let us 
first consider how to generate samples of the distribution V(R.). This is most conveniently done 
by the Metropolis algorithm [33] which generates a Markov chain of samples by a stochastic walk 
in the configuration R-space. The stochastic samples are sometimes colloquially referred to as 
random walkers. The algorithm enables us to sample any desired multi-dimensional probability 
distribution ■P(R) without a prior knowledge of its normalization. Given some transition rule 
P(R R') from R to R', which satisfies ergodicity and detailed balance, 

P(R H!)V{Ii) = P(R' R)P(R'), (15) 
the desired probability density ■P(R) will converge to an equiUbrium state given by 



/ 



P(R-^R')?'(R)dR = P(R'), (16) 



see, for example, [34]. The transition rule P(R — > R') can be further written as a product of the 
sampUng distribution T(R R') (typically Gaussian distribution centered around R) and the 
probability of an acceptance A(R R') of the proposed step 

P(R^R') = r(R-^R')^(R-^R')- (17) 

The conditions of detailed balance are satisfied by choosing the A(R — )• R') to be 

T(R' -> R)-P(R')' 
T(R ^ W)ViIi) 



A(R R') = min 



(18) 



In this manner, the Markov chain initialized at some random state evolves towards an equilibrium. 
After some initial convergence and equilibration period we can start to collect the statistics. 
Assuming that we can apply the central limit theorem, the usual estimators for the mean, variance 
and error of the mean are given by 

1 ^ 

= ^Y.^^^rn)+0{l/^), (19) 

m 

M 



CTA 



M 

where M represents the number of statistically independent sampling points. 

The computational demands depend on the desired error and on the variance a\. Seem- 
ingly, the estimators and the error bar do not depend on the dimensionality d of the problem. In 
general, however, one needs to take into account the impact of increasing d for the following: i) 
amount of calculation needed to obtain statistically independent sample of P(R) and for evalu- 
ation of j4(R); and ii) growth of variance u\. Providing all of these grow as some polynomials 
of d, the method guarantees the overall polynomial scaling of the algorithm. In its basic form the 
expression for computational demands is then 

_ 

^comp — ~2~ '^sample.'! (^0) 

^ A 
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where Tsampic lumps the time needed to obtain statistically independent sample and to evaluate 
required quantities. 

We note that it is often possible to decrease the variance by the importance sampUng methods 
and thus increase the efficiency as explained, for example, in Ref. [34]. 



2.2 Variational Monte Carlo 

Let us consider a trial variational wave function ^t(R-) which approximates the lowest eigen- 
state of Ti, for a given symmetry. We will elaborate on constructions and properties of '^i in 
Sec. 2.4. In variational Monte Carlo (VMC) the expectation values for the given 4't(R) are 
evaluated by stochastic sampling. For example, the total energy is given by 

/vi/^(R)7^vl/T(R)dR /|vl/r(R)pigL(R)dR ^ 
EvMC= j^^(R)^^(R)dR = /|*T(R)PdR ^^^^ 

where we have introduced the so-called local energy 

- ^^^^ 
The VMC estimator Evmc is then given as 

1 ^ 

Evmc = j^Y.^Li^) + 0{l/sfM), 

m 

assuming the set of M configurations is distributed according to P(R) = |\E't(R)P. 



(23) 



2.2.1 Correlated Sampling 



Correlated sampUng exploits the fact that the variance of difference of correlated random vari- 
ables X and Y decreases as their correlation increases. For example, consider variational energy 
energy difference for two close wave functions '^^^ and e.g., for different ionic positions 
or with small external field when evaluating responses. The energy difference can be written as 



El — E2 



e[^\r) dR 



E^^'iH) dR 



/|*«(R)PdR 



/l4^)(R)|2dR 



*W(R)| = 



,(1) 



_/|*^'^(R)|2dR /u;(R)|*^'^(R)|2dR 



w{R)E<f\R) 

(1)^ 



dR, 



(24) 



where the re-weighting factor is given by ^(R) = |^'y^(R)p/|\l/^^'(R)p. Therefore, we can 
obtain the above energy difference as a sum over weighted local energy differences. The esti- 
mations can be carried out using the set of samples drawn, for example, from the distribution 
(^'^•')^. There are other possibilities such as sampling |\l'^^''\l'y ^ | and modifying the expres- 
sions accordingly. By taking the difference, the noise mostly cancels out, since for very similar 
wave functions the random fluctuations will be very similar and the weights will be close to 
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unity. Consequently, the variance of the difference can be much smaller than the variance of 
each individual quantity alone. The actual decrease in variance depends on particular application 
but gains can be two orders of magnitude or more. The algorithm can be further generalized for 
more complicated estimators and has been developed also for the diffusion Monte Carlo method 
presented below [35]. 



2.3 Diffusion Monte Carlo 

2.3.1 Imaginary Time Schrodinger Equation 

Diffusion Monte Carlo method belongs to a class of projection and Green's function QMC ap- 
proaches. Let us consider the imaginary time Schrodinger equation 

-^^'^ = (^-^T)*(R,r), (25) 

where r is a real variable (imaginary time) and Et is an energy offset. The effect of imaginary 
time is an exponentially fast projection of the lowest state of a given symmetry from any trial 
function with non-zero overlap. This can be easily seen by expanding ^'(R, r) in eigenstates 
{$„} of W so that for time t ^ oo one finds 

lim ^(R.r) = lim y($„|*(0)) e-^^^"-^^^ $„(R) (26) 

r^OO T—^OO ' ^ 

n 

= lim ($o|*(0)) e-"(^»-^^)$o(R)- 

r— >oo 

Although the absolute ground states of the Hamiltonians we consider are bosonic, by keeping 
the wave function in the antisymmetric sector, one can find the lowest antisymmetric state. The 
parameter Et can be adjusted to E^ so that the exponential weight of ^ becomes asymptotically 
constant. It is convenient to rewrite the equation (25) into into an integral form using the Green's 
function 

G(R ^ R',t) = (R| e-^(«-^^)|R'). (27) 
Then after propagation by a time interval Ar the wave function is given by 

*(R,t + At)= ^ G(R^R',Ar)*(R',T)dR'. 



(28) 



and the propagation can be continued by iterations. 

Let us first consider the Schrodinger equation with non-interacting particles 

The equation has a familiar form of diffusion in R-dimensional space. Assuming that we have 
N particles in a 3D space the corresponding Green's function is a SiV— dimensional Gaussian 
with variance = r. The equation describes a well-known process of diffusing particles in 
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a random noise medium with the diffusion constant equal to 1/2. The key departure from tra- 
ditional methods of solving such differential equations is in the solution representation. The 
quantum amplitude ^'(R, r) is described by sampling, i.e., as a density of random walkers (delta 
functions in R— space) 

M 

*(R, r) = dens[^ 6{R - R™(r))] + 0{1/Vm), (30) 

m=l 

where by dens we denote the density estimator, which can be carried out by appropriate binning 
of the walker ensemble. The function is therefore represented only statistically so that the values 
are known up to the statistical uncertainty term as indicated above. If we substitute the ensemble 
of delta functions into the integral Eq.(28), we obtain Gaussians of width r centered at the loca- 
tions {Rm}- In order to restore the walker representation back we then sample the Gaussians, 
i.e., each walker evolves to a new position as given by 

R„(r + Ar) =R„(r) + \/A7x, (31) 

where % is a SA'^-dimensional vector of independent random numbers with Gaussian distribution 
and unit variance. The process is iterated until the long-time limit r — > oo is reached. 

Considering now more interesting Hamiltonian with external potential and interaction terms 
represented by V, one can find the following short-time approximation [36] of the Green's func- 
tion using the Trotter-Suzuki formula as 

G(R ^ R', r) = (R| c-"(^+^-^^) |R') (32) 



(27rr)-3^/2exp 



/\2 



(R - R Q 
2t 



exp 



V(R) + V(R') 



-E' 



The approximation is of the order of 0{t^). For sufficiently small time step we can propagate 
the walkers even for Hamiltonians with complicated local interactions. The last term shows 
that the propagation involves two processes: Gaussian diffusion caused by the kinetic energy 
operator, as in the case of free particles, and reweighting, which originates from the presence 
of the potential. Note that the weights will vary very strongly whenever the potential varies. 
This is true, in particular, close to any potential singularities and makes such process very noisy 
and inefficient. In order to make the sampling more efficient we need to smooth out the terms 
entering the weights and to modify the the algorithm. 

2.3.2 DMC Importance Sampling 

Let us introduce importance sampling [36-38] Instead of sampling the wave function \I'(R, r) 
we will sample a mixed distribution /(R, r) = 'J't(R)^(R, t), where ^'t(R) is some trial 
function. Multiplying the Schrodinger equation with '^t and rearranging the terms leads to the 
following equation for the mixed distribution / 

_ c?/(R,t) ^_ lvV(R,r) + V- [vc(R)/(R,t)] (33) 

+ (^i(R)-i;T)/(R,T), 
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where (R) = V In | "fx (R) | represents a drift velocity term and El (R) is the local energy 
given by Eq. (22). The corresponding short-time approximation of the Green's function is given 
by 



G(R ^ R', A.) «(2.A.)-3-/^ exp " ^' " ^^^^^^'^^ 



2At 

. f El{R) + ElCR') ^ 

X exp -At ( ^ — — ^ — - - E 

= Gd(R.^ R', At) X Gb(R ^ R', At) 



(34) 



Here, we can again interpret the action of the short-time Green's function as a diffusion-drift 
process evolved according to Gd and the re- weighting process Gb controlled by the average 
of local energies at R, R' and Et- This transformation has several implications. First, due to 
the drift, the density of walkers is enhanced in the regions where \E't(R) is large. Second, the 
re-weighting term contains the average of local energies, which for ^'y approximating ^fo, is 
close to a constant and therefore much better behaved than possibly unbounded potential 1^(R). 
By a proper choice and construction of 'ifx one can eliminate large amount of statistical noise, 
potential singularities, etc. For any large-scale calculation accurate is important and improves 
the efficiency by two orders of magnitude or more. 

In actual simulations, the walkers are initially distributed according to /(R, 0) = |^'t(R)P 
with the weights of all walkers set to unity. The propagation then proceeds by iterative application 
of the Green's function. First, the position of each walker evolves according to the diffusion- 
drift Gd from Eq. (34). However, accuracy of Go is not uniform and decreases in regions with 
strongly varying drift. In order to correct for this finite time step bias the walker move is accepted 
with the Metropolis probabiUty 

A(R^R0.mi„fl,S5ffi^iMSl(?O!!y ,35) 
V Gi,(R^R')I*t(R)IV 

The time step At is chosen to be small so that the acceptance in the DMC method is high, typi- 
cally 99% or so. The Metropolis step improves the Green's function approximation appreciably 
and decreases the time step bias due to the fact that it enforces the detailed balance condition 
which is fulfilled by the exact Green's ftmction. 

After evolving the walker position we need to update the walker weight according to 

Wm{T + Ar) = Gb (Rm ^ R^ , At) w„ (t) . (36) 

Note that over the course of the simulation some weights will grow very rapidly due to the 
exponential nature of the weight product. As a result, in the long-time limit, the weight of a single 
walker will exponentially dominate the rest. It is therefore necessary to introduce importance 
sampling also for the weights and adjust the walker population to improve the efficiency of the 
reweighting. One possible way how to accomplish this is stochastic branching and elimination 
of walkers. Consider a walker evolving from R'„, to Rm according to Goi^m ^ R'n)- The 
weight for the step becomes Wm = G_b(Rto R-m)- We can now calculate rim = iiit[?7 + 
w], where is a uniform random number from the interval [0, 1], and int is the integer part. 
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The resulting integer rim is the the numher of walker copies placed at the position R„, and 
subsequently evolved as independent walker(s). In case rim = 0, the walker is eliminated from 
the ensemble. In this manner, the branching/eUmination step enables to keep all the weights 
= 1, = 1, Af throughout the simulation. More elaborated approaches for sampling the 
weights can be found elsewhere [39]. 

After an equilibration period, we can start to collect the statistics needed for the calculation 
of the projected ground state energy 

_ /^*(R,r)?^^T(R)dR 

J^DMC = hm ^.T, /T-.N ') 



M 

or other quantities. 



/«'*(R,t)*t(R) dR 
^ Jf{R,T)EL{R)dR 
r'^ //(R,r)dR 

-^^i^4R„) + 0(l/VM) 



2.3.3 Fixed-Node Approximation 

For simphcity, we assume that the Hamiltonian has the time reversal syrmnetry (i.e., no magnetic 
field, induced currents, etc) so that the states can be chosen real. For bosonic systems, the ground 
state wave function has no nodes and is positive everywhere. However, the fermionic wave 
function is by definition antisyrmnetric, and therefore can no longer be represented by the walker 
distribution, which has to be non-negative everywhere. Clearly, for fermions, the wave function 
will have both positive and negative amplitude domains defined by the nodes — subsets of the 
configuration space where the wave function vanishes, i.e., \E'(R) = 0. (The nodal properties of 
wave functions will be studied in detail in the subsequent section.) 

We could try to overcome this problem by decomposing ^(R, r) into positive functions 
(R, r) and (R, r) and assign each walker a positive or a negative weight accordingly 

*(R,r) = *+(R,T)-*-(R,r). (38) 

However, since the Schrodinger equation is linear, both (R, t) and (R, r) independently 
converge to the same bosonic distribution, hence making the fermionic component to decay in 
time (see Fig. 1). 

This is a well-known fennion sign problem where the fermionic "signal" disappears expo- 
nentially quickly in the projection time. The method is possible to use providing one can esti- 
mate quantities of interest before the efficiency deteriorates, nevertheless, for large systems the 
method becomes very inefficient. The inefficiency can be eliminated at the price of introducing 
the so-called fixed-node approximation [36, 40-42], which restricts the evolution of positive and 
negative walkers within the separate regions defined by the sign domains of the trial wave func- 
tion 'I't(R). This is achieved by enforcing the mixed distribution /(R, r) = ^t(R-)'S'(R, t) to 
be non-negative everywhere 



/(R,t)>0. 



(39) 
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x=0 




Fig. 1. Imaginary time beliavior of the walker distributions ^' (red) and 'I'" (blue). Upper figure: at the 
beginning of the propagation (r = 0). Lower figure: after some imaginary time evolution. The fermionic 
signal decays as (^f^ — ^f") oc exp[— r(i5o^ — Eq)], where Eq — Eq is the difference of fermionic and 
bosonic ground state energies. 



In other words, ^'(R, r) shares the same zeros and sign structure as ^t(R-) for all r, so that 
Eq. (39) holds. This is an important conceptual change in the sampling process: the fixed-node 
approximation replaces the (nonlocal) antisymmetric condition by the local boundary condition 
in Eq. (39). Note that the drift term diverges at the node 4't(R) = and therefore repels the 
walkers away from the nodal regions. The fixed-node approximation has several appealing prop- 
erties. The change of symmetries into boundaries effectively bosonizes the fermionic problem 
and enables us to apply the statistical machinery to solve the evolution equation. The sketch of 
the fixed-node DMC in Fig. 2 reminds us Feynman path intergrals and the walker paths really 
are closely realated to quantum paths except that they happen in imaginary instead of real time. 

The fixed-node solution provides the lowest possible energy within the given boundary con- 
ditions and as such is a rigorous upper bound to the true ground state energy [36, 42]. This result 
also implies that resulting fixed-node energy bias is proportional to the square of the nodal dis- 
placement error and in actual applications the method has proved to be very successful as further 
elaborated later. 

The basic form of the fixed-node DMC method is conceptually rather straightforward. Let 
us therefore sketch a simple version of the algorithm. We assume that 4't(R) approximates the 
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desired groimd state and conforms to the required symmetries. The algorithm is as follows. 

(1) Generate a set of walkers {'Rm}m^i distributed according to for example, by using 
the VMC method. 

(2) Evaluate the drift v(R) = Vr In |*t(R)| for each walker. 

(3) Evolve each walker from its position R' to a new position R according to 

R R' + Atv(R) + X, (40) 

where x is a 3N-dimensional vector with components drawn from the normal distribution 
with zero mean and variance At. 



(4) Accept the move with the probabiUty 
P„ec(R ^ R') = min 



' Gi,(R'^R)*2.(R/)''-' 



(41) 



where ,s = sign[5'y(R)] and s' — sign[^7-(R')]. 

(5) For each walker calculate the number of copies positioned at R as 

n(R) = mt{r] + exp[-r(^i(R') + El(R) - 2Et)/2)]}, (42) 
where rj is drawn from a uniform random distribution on the interval [0, 1] . 

(6) Evaluate averages of interest over the ensemble of walkers and update Et as 

EriT + At) = Erir) - Cad, HMact/Ma,,), (43) 

where Mact is the actual walker population, Mave is the desired walker population and 
Cadj is a positive constant, which tunes how quickly the population would reach the desired 
value. The constant is typically chosen to rebalance the population to the desired number 
in a few steps, say, for 10 steps its value is « 10/t. 

(7) Repeat the steps (2) to (6) until the error bars of the calculated quantities reach the desired 
values. 

Note that in the step 4) we correct the short-time Green's function approximation in two ways. 
Besides the acceptance step explained previously, the fixed node approximation is enforced for 
the case when a walker violates the nodal boundary. This occasionally happens, again, due to the 
Green's function inaccuracy. Since the evolution is discretized, the inherent fluctuations in the 
diffusion step can cause the walker to cross the nodal boundary — although for the exact Green's 
function the probability of such an event vanishes. As mentioned above, the time step At is 
usually chosen to be small enough so that the acceptance is 99% or higher. Any remaining time 
step bias can be eliminated by extrapolating At — > from repeated simulations with decreasing 
At. 
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Fig. 2. On the left is a sketch of the fixed-node diffusion Monte Carlo method. The first excited state of ID 
harmonic oscillator serves as a fermionic ground state toy model. Walkers are generated from a constant 
initial distribution and are divided into "plus" and "minus" populations according to the trial function (in 
ID the node is a point). Enforcing the fixed node condition implies that the two populations never mix. The 
plot on the right shows walker diffusion and drift steps close to the node which is depicted by the dashed 
line. 



In order to clarify the fixed-node DMC scaling of the computational time in the system size 
we invoke the previously given formula Eq.(20). We assume a requirement of the fixed error 
bar e for the total energy. The growth of variance cr^ is basically proportional to the number of 
electrons N, since for large systems fluctuations from different electrons will be approximately 
independent, so that cx N. The time Tgampie then includes evaluations of the wave function 
and operators — such as the kinetic energy, etc — and we can write for Tsampie = C2N'^ + c^N^. 
The quadratic component comes from the pair terms in the Jastrow factor and also from filling 
the Slater matrices, while the cubic part is from evaluating the determinant. For systems with 
up to a few thousand electrons C2 >> C3 so that the quadratic term dominates. There is also 
some cost associated with an increase in statistical correlation of samples which we estimate as 
being of the order of TV. The overall scaling then looks like N'^. However, this appears to be a 
rather conservative estimate. For larger systems one can recast the one-particle orbitals into the 
Wannier orbitals which are localized and also impose sufficiently loose cut-off on the range of 
the Jastrow correlations. That makes the Slater matrix sparse [6] and basically eliminates one 
power of N. If we are interested in intensive quantities such as cohesion (energy per particle) that 
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eliminates another factor of N. For some energy differences correlated sampling shows similar 
improvements and so forth. The algorithms and implementation has a significant impact here 
and in practice the fixed-node DMC calculations scale approximately as where 2 < p < 3. 



2.3.4 Beyond The Fixed-Node Approximation 

The fixed-node approximation was has been an important step on the way to overcome the fun- 
damental difficulties with fermion signs. Besides improving the nodes with more accurate trial 
wave functions (see Sec. 4 and 5) or attempts to understand them (see Sec. 3), several ideas how 
to reach beyond the limits of fixed-node approximation were proposed and tested. 

Ceperley and Alder [43] developed the released-node method. As the name suggests, the 
walkers are allowed to sample the regions of opposite sign by using the guiding distribution, 
which is positive everywhere including the nodal regions. The method suffers from the fermion 
sign problem but enables to somewhat better control the loss of the fermion signal by appropriate 
choice of the guiding distribution and other tune-ups. The algorithm produces transient estima- 
tors so that the decay of fermionic excitations has to be faster then the decay of the fermionic 
ground state component. Let us explain the basic idea of the release node approaches. The 
fixed-node condition is eliminated and instead of '^t the importance sampling is carried out 
with another importance distribution \E'g which is sometimes called the guiding wave function. 
UnUke "^t, which is antisymmetric, "ia is symmetric and positive in the whole configurations 
space, allowing thus the walkers to reach any point. The DMC simulation then runs according to 
the importance sampled Eq.(33), where '^t is replaced by '^q- The resulting release node '^rn 
solution is found from f{R, r) = ^r^'^g- The release node solution is transient and can be 
quaUtatively expressed as 

*Kiv « c{t)^f{t) + (1 - c{t))^b{t), (44) 

where c(r) « exp[— — Eb)t\ and indices F and B correspond to fermions and bosons, 
respectively. The energy of the fermionic component can be evaluated from 

^ //(R,r)||i;^(R)dR 
//(R,r)|5dR 

E 'I't(Rto) IT} \ 



where a cx (Ep — Eb)- Each weight Sm carries a sign which is assigned initially as Wm{T = 
0) = sign[5'T(Rm)] and kept constant during the simulation. A simple guiding function can 
be written as = a/^*^ -I- a, where the parameter a controls the average rate of walkers 
passing into the opposite sign region. More sophisticated forms are necessary for inhomogeneous 
systems. The transient samphng efficiency can be measured by the ratio 



s (46) 
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which, ideally, should be close to unity (as is the case for the fixed-node algorithm when = 
I^'tI). As the simulation proceeds, the ratio decreases and the fermionic signal decays while 
the statistical noise grows. Despite this, the method can be useful providing one can reach the 
fermionic ground state estimator before the error bars become too large. In fact, the release 
node method was used for the famous calculations of the homogeneous electron gas correlation 
energies [23] and for small atoms and molecules later [43]. Note that the simplicity of the 
homogeneous electron gas enabled to apply this method to hundreds of electrons already in 
1980. Unfortunately, applications to inhomogeneous systems are much more difficult and have 
progressed at much slower pace. 

Another idea advanced by Kalos and collaborators is based on the cancellation of paired 
walkers with opposite signs [44]. This method has been subsequently improved to work with 
fuUy interacting ensemble of walkers [45]. The method delays the fermionic decay and enhances 
the fermion solution component, however, the current understanding is that the scaling appears 
to be exponential. An important improvement is, however, that the exponent is smaller than the 
difference between the bosonic and fermionic energies and therefore the technique has a potential 
for efficiency gains in some applications. [46]. 

Anderson and Traynor [47] combined some of the ideas in the fixed-node, released-node and 
cancellation methods in an algorithm which employed improvements such as relocation after 
the node crossing, self and multiple cancellations, and maximum use of symmetry. Among the 
systems of interest, the authors calculated the excited state of H2 '^Sj and the barrier height for 
the simplest possible chemical reaction reaction H -h H2 — > H2 -|- H [48]. The accuracy of this 
essentially exact calculation was exceptional with resulting statistical uncertainty of only 0.01 
kcal/mol. 

Among other alternative attempts, let us also mention the method of adaptive nodes (see, e.g., 
Ref. [49]). The key idea is based on representing the approximate nodal function directly by the 
walker ensemble (e.g., by positioning a Gaussian on each walker). This results in the adaptive 
description of the nodes that does not depend upon a priori knowledge of the wave function. The 
main difficulty of the method, however, comes from scaling to larger systems since one basically 
has to populate the whole configurations space with walkers. 

In simulations of quantum problems the fermion sign problem is pervasive. It appears in 
various incamations and determines computational complexity of the underlying problem. It 
has been recently shown [50] explicitly, that QMC simulations of random coupling Ising model 
with fermion sign problem belongs to the class of NP-hard problems, i.e., with non-polynomial 
computational complexity. There are exceptions which enable to eliminate the fermion sign 
problem exactly such as separable Hamiltonians or problems with certain symmetries (the half- 
filled Hubbard model, for example). However, all such known cases are "special" having some 
underlying symmetries which enable to eliminate the fermion signs. It is therefore possible that 
any exact QMC approach with broad applicability belongs to the class of NP-hard problems and 
therefore finding such method might be an elusive and exceedingly challenging, if not impossible 
goal. The next best goal should be therefore based on a more pragmatic view on what does it 
mean to "solve" a given many-body quantum problem. Perhaps the right question to ask can 
be formulated as follows: What is the most optimal scaling of a method which can calculate 
expectations within a prescribed error? (say, energy differences with errors below 0.1 eV, 0.01 
eV and so forth). Here by the error we mean the total error which contains both systematic biases 
from approximations and statistical uncertainties. The answer to this question is a lot easier, at 
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least in a qualitative sense, since already at present the electronic structure QMC methods can 
achieve accuracies of 0. 1-0.2 eV for energy differences with low-order polynomial scaling in the 
system size. Similar progress has been achieved also in simulations in other areas of physics. 

2.4 Trial Variational Wave Functions 

The great advantage of QMC methods is the capability to use explicitly correlated trial wave 
functions. Their quality is the most important factor which influences both the achieved accuracy 
and efficiency. On the other hand, repeated evaluations of (and VvI/t, V^\1/t, etc) is the most 
costly part of QMC calculation and therefore restrict possible alternatives for to forms which 
can be evaluated rapidly and scale with the system size. 

2.4.1 Basic Properties 

^j/r has to be antisymmetric with respect to exchange of both spin and spatial coordinates 

*t(PR, PS) = (-l)^*r(R, S), (47) 

where S = (cti, crjv) are discrete spin variables with values ±i and P is an arbitrary per- 
mutation with the sign equal to (—1)^. Assuming that the Hamiltonian (or other quantities of 
interest) does not involve spin-dependent terms, any expectation value requires only the spatial 
antisymmetry since the sunnmation over spin variables becomes just a permutation of indices 
inside the same spatial integral. That enables to assign the spins to particular electrons and re- 
quires imposing antisymmetry only with respect to spatial interchanges between the electrons of 
the same spin. The spins of all electrons are therefore fixed to the total spin projection equal to 
^{N^ — Ni). The first Nf particles are labeled as spin-up and the rest N — as spin-down. 
Throughout this paper we assume that the spin variables are factored out by this labeling scheme. 
Using an appropriate expansion in determinants the wave function can be made an eigenstate of 
the total spin 5^ and possibly other integrals of motion such as the total angular momentum 
and Lz in atoms. 

We demand that approximates a given bound state so that / / '^'^'H'^t and 

/ \E'^'H^\I/t do exist. The last condition is necessary for having a finite variance. Further, 
and V^'t have to be continuous, wherever the potential is finite. In addition, the continuity of 
the potential implies also the continuity of V^\I/t- Important corollary is that as *t approaches 
an exact solution, the local energy becomes constant everywhere. We can express this also as 
cr^ = (('H — (W))^) Ofor^'r $o> and it is often called as zero variance /property. Note that 
both VMC and DMC estimators have the zero variance property. (This is not automatically true 
for many QMC approaches, in particular, this does not apply whenever there are non-localities 
in the Hamiltonian. For example, many quantum lattice models such as the Hubbard model are 
nonlocal due to hopping terms.) 

We wish that the local energy is close to the eigenvalue and also that the variance as small as 
possible. Therefore we demand that any singularity from the potential is to be canceled by the 
opposite sign singularity in the kinetic energy, what gives rise to Kato cusp conditions [51, 52]. 
As the distance of electron to nucleus vanishes, vn 0, the potential energy divergence cancels 
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out when 



1 



= -Zt 



(48) 



This electron-nucleus cusp condition [Eq. (48)] can be satisfied by a proper choice of orbitals, 
removed by pseudopotentials (see Sec. 2.5) or enforced by Jastrow factor. Of course, the hy- 
drogenic solutions such as, for example. Is and 2p orbitals given as (Ai, (r) = cxp(— Z/r) and 
4'2p{r) = exp(— Z/r/2)YiTO, have the correct cusp, hi addition, it is clear that any linear com- 
bination of orbitals with the correct cusp has the same property. In particular, if all orbitals in a 
Slater determinant have correct cusps then the same is true for the determinant what is easy to 
verify by an expansion in cofactors. 

Similarly, as the distance rij of any two electrons vanishes, there is a divergence in the 
electron-electron Coulomb potential. Let us first assume that electrons i and j have unlike spins. 
In general, the wave function does not vanish at such a collision point. Close to the singularity we 
can write it as an s-wave in rij times a smooth function which depends on the rest of the variables 
R, i.e., limri^^o^(R) = exp{crij)F{'R). Since there are two contributing Laplacians A, -|- Aj 
we get 



1 a*(R) 

*(R) dnj 



(49) 



For two like spins electrons with r^j — >^ the wave function vanishes. The behavior around 
the collision point has a character of a p-wave. Let us transform the coordinates rj and Vj to 



and r+- 



Ti + Tj. The wave fimction can be then written as Zzmr^ -i.o^(R) 



/(r^j) exp(crij )i^(R). By an appropriate coordinate rotation the linear behavior around r^j — > 



can be expressed as /(r^j ) 
modifies to 



[xi — Xj) +0{{xi — Xj)'^). The like spins cusp condition therefore 



1 a* 



,=0 



(50) 



2.4.2 Trial Wave Functions Forms 

It is convenient to express \E't (R) as a product of an antisyimnetric part and an exponential 
of a symmetric correlation fimction Ucorr{^)> often called the Jastrow factor, as given by 

*t(R) = *a X exp[?7co^^(R)]. (51) 

Let us describe the forms for ^^(R), Ucorr(R) and their generalizations as commonly used 
in calculations (and also implemented in our QMC code QWalk [53]). The Jastrow part [36, 54— 
56] explicitly depends on electron-nucleus and electron-electron distances 

Ucorr{{rij}, {ru}, {rji}) = ^ xirri) + uinj) + ^ w{rij, m, rji) (52) 

il i^j i^j-I 

and is written as the sum of one-body (electron-nucleus), two-body (electron-electron) and three- 
body (electron-electron-nucleus) terms. These correlation functions describe several effects. 
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First, the form captures the electron-electron cusp conditions [Eqs. (49) and (50)]. As derived 

above, these are satisfied providing 



du„ 



1/2 (^3) 

i/2 (Ji — —CTj, 



and while all the remaining functions have zero derivatives for rij = 0. Second, the terms which 
depend on electron-ion distances enable to vary the electron density which needs to be adjusted 
once the electron-electron terms are included. Third, the triple electron-electron-ion correlations 
enable to decrease the correlation effects and decrease the local energy fluctuations. Expanded 
in the basis of one dimensional functions the components of the Jastrow factor take form 

X{r)=^crak{r), (54) 

k 

u{r) = fcuspir) + cl%k{r), (55) 

fc 

w{rij,ru,rji) =^c%'ili^[ak{rii)ai{rji) + ak{rji)ai{r^i)]bm{rij). (56) 

klm 

The above Jastrow factor has proved to be very efficient in describing significant portion of corre- 
lation effects with small number of variational parameters. More details about the implemented 
basis functions can be found below or in the QWalk documentation [53J. 

The simplest antisymmetric functional form is the determinant so that one can expand the 
wave function in spin-factorized Slater determinants as 

*i'"*-(R) = Y^dk det[. . . , iflivi), . . .]det[. . . , y^jirj), . . .] (57) 

fe 

where i = 1, . . . , Af^ and j = N-^ + 1, . . . , N). Each spin-up and spin-down Slater determi- 
nant is a function of one-particle orbitals typically found by (post-)HF or DFT methods. In the 
introductory part we have touched upon some of the properties of x^; Slater ^ Although formally 
the Slater determinants provide a complete set of antisymmetric functions, in practice, we are 
limited to use only finite expansions, very often to a single determinant. It is remarkable that the 
single determinant Slater- Jastrow form proved to be very effective for many applications, per- 
haps beyond expectations. It provides surprisingly accurate energy differences such as cohesion 
and binding, excitations, etc, due to partial cancellations of the fixed-node errors. 

On the other hand, multi-determinant expansions which are used to improve the fixed-node 
energies require rather involved and technically difficult processing such as the following tasks: 

(a) Large determinantal expansions often need computationally expensive reoptimizations of 
the determinantal coefficients. At present, with such wave functions we are thus boimd to 
small or intermediate sizes of chemical systems. 

(b) For very high accuracy calculations one-particle orbitals need to be reoptimized. Typi- 
cally, orbitals come from methods which do not incorporate the explicit correlations and 
therefore are not optimal with Jastrow correlation factor included. However, full optimiza- 
tion of orbitals in many-particle systems is still a formidable challenge. What has been 
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successfully tried is to parameterize the effective one-particle Hamiltonian such as hybrid 
DFT functionals and find optimal orbitals in such restricted parameter space. 

The above limitations motivate to think about possibilities beyond Since the determi- 

nant is the simplest antisymmetric form constructed from one-particle orbitals, it is interesting to 
ask what is the simplest antisymmetric object built from two, three or n— particle orbitals ? For 
two particle orbitals such antisymmetric form is called Pfaffian and the resulting Pfaffian pairing 
wave function is written as 

pf,(l, ...,N)= A[4>{h 2), (^(3, 4), . . . , 4>iN - 1, N)] (58) 
= pf [^(1, 2), ^(3, 4), . . . , 0(iV - 1, N)], (59) 

where A is the antisynraietrizing operator and the arguments of the pairing spin-orbital (f){i,j) 
denote both spatial and spin variables. Note that in the most basic form the Pfaffian contains only 
one pair orbital unlike the Slater determinant which requires as many independent orbitals as is 
the number of particles. 

Similarly to multi-determinantal expansions, we can expand the wave function in a linear 
combination of Pfaffians 

= Y^Wm Pf„(. . . , 4>m{i,j), . . .). (60) 
m 

Note that each Pfaffian in the expansion (60) contains different pairing orbital. The form will 
be later simplified to spatial antisymmetries only such as in the case of determinants by em- 
ploying spin-singlet and spin-triplet pairing orbitals. The description of Pfaffians together with 
applications to several systems and possible extensions is discussed in more detail in Sec. 4. 

Another way how to boost the variational freedom of trial functions is based on the so-called 
backflow coordinates. The idea goes back to Feynman and has been employed for description 
of homogeneous systems [55, 57-62]. Feynman suggested that correlations in quantum liquids 
can be approximately captured by replacing the actual coordinates R in *a(R) by quasi-particle 
coordinates X which take into account the influence from the rest of the particles. As a con- 
sequence, the nodes and other properties of ^'^(X) will be, in general, different from those of 
^^(R). Recently, some progress was also reported in applying this idea to chemical (inhomo- 
geneous) systems [63, 64]. In this review we report some of the results we obtained with these 
new developments. The quasi-particle coordinate of the i-th electron at position rj is given as 

Xi = ri+^.(R) 

= r, + ^r(R) + C(R) + (61) 

where we have again divided the contributions to one-body (electron-nucleus), two-body (electron- 
electron) and three-body (electron-electron-nucleus) backflow terms. They can be further ex- 
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pressed as 

C(R) = Ex(^i/)ri/ (62) 
$r(R) = Eu(ri,)ri,- (63) 

^««"(R) = EE[w;i(rij,ri/,rj-7)rij- +u;2(r-ij,n/,rj-7)ri/], (64) 

where Vij = — v.j. x, u and w;i with W2 are similar to one, two and three-body Jastrow terms. 
The calculational and implementation details together with some results are further discussed in 
Sec. 5. 



2.4.3 Basis Functions for the Jastrow Correlation Factor 

The one-particle orbitals and also the correlations has to be represented in an appropriate ba- 
sis functions. There is a vast amount of literature on basis sets for expressing the one-particle 
orbitals. Typically, the localized basis sets are based on Gaussians, while for periodic systems 
plane waves are frequently used. Other options exist such as numerical orbitals, naixed basis, 
etc. We have little to add to this technically involved subject except that QWalk is able to import 
several types of these. We refer the reader to the literature or manuals for the actual packages. 

However, we briefly outline the basis which is used for correlations in the Jastrow factor. The 
exact electron-electron cusp conditions are satisfied by the choice of foUowing cusp function [see 
Fig. (3)] with variable x = r/rcuu where Vcut is some cutoff radius and 7 is the curvature as 

The cusp constant is c = | for electrons with like and c = ^ for electrons with unlike spins. 

Polynomial Pade functions for the same variable x = r/rcut and curvature /3 had proved to 
be a convenient choice for describing the electron-electron and electron-nucleus correlation. In 
calculations, we use the form 

a.(r), hM = fpoiy-Paaeix, P) = ^ ^ _ ^ 3^2) (66) 

The fpoiy-Pade{0) = 1 with derivative fpoiy-Padei^) = ^l^o goes smoothly to zero 
as r ^ Tcut [see Fig. (4)]. These conditions are necessary for preserving the cusp conditions 
already fixed by the cusp functions and by the choice of orbitals or pseudopotentials. Polynomial 
Pade functions are then used as basis functions in expansions for correlation part in the Jastrow 
factor, for example, for the electron-nucleus part we have 

X{r) = ^crak{r), (67) 



where Ck is an expansion coefficient and basis functions Ofc (r ) with different k differ in curvatures 
^fe. Similarly for other components as given by Eq. (67). 
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Fig. 3. Cusp functions with smooth cut-off for two different values of the curvature parameter (7=1 and 
7 = 10) for hke (c = |) and unUke (c = |) spins. 

2.5 Pseudopotentials 

To a high degree of accuracy, most properties of real systems are determined by the states in 
valence electronic sub-shells only. Atomic cores are also expensive in QMC simulations since 
the computational cost increases with the atomic number Z as « (see Ref. [65, 66]) and 
makes the all-electron calculations unnecessarily difficult for heavy atoms. The presence of core 
electrons complicates calculations for several reasons. The short length scales associated with 
atomic cores states require decrease in the QMC time step. Even more difficult problem is the 
increase in fluctuations of the local energy as Z gets higher due strongly varying kinetic and 
potential terms close to the nucleus. This implies also basic efficiency issue since almost all 
computational time would go to sampling energy fluctuations in the core instead of the important 
valence effects. 

In analogy with theories such as Hartree-Fock and DFT, in QMC the atomic core states can 
be eliminated by introducing an pseudopotential (or effective core potential) operator which rep- 
resents the impact of core on valence states. The action of pseudopotential is typically different 
for states with different angular momenta. It is customary to divide the pseudopotential operator 
into local (common to all angular momenta) and nonlocal (different for each angular momentum) 
parts. For a given ion we can therefore eliminate the core states and replace the ionic term as 
follows 

— ^ UP' ^ yP'ir) + (68) 
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where V^" (r) is the local part while 

^nzoc ^ J2 yP'ir) J2 \lm){lm\ (69) 

/ m 

is the nonlocal pseudopotential with projectors over Im states of angular momentum. The non- 
local portion of the local energy i?/,(R) from the electron i in the field of the ion / is given 
by 



S^.rpsf ^2^ + l f , *T(ri, ■ • ■ ,r',, . . . ,rjv) 

, 47r WT(,ri, . . . ,ri, . . . ,rArj 



(70) 



where Pi denotes the Legendre polynomial of degree I. The angular integral above is over the 
surface of the sphere passing through the i-th electron and centered on the ion. The integral is 
evaluated numerically using Gaussian spherical quadrature grids. When the orientation of axes 
of a given quadrature are chosen randomly, the result is an unbiased Monte Carlo estimation of 
the given integral (70). For more details, see original Ref [67]. 

In the DMC method, the application of the nonlocal operator W"^°'^ causes the matrix el- 
ement (R| cxp[— Ariy"'"'^] |R'), and therefore also the propagator, to have a fluctuating sign. 
This would create its own fermion sign problem even for a single electron. Consequently, the 
imaginary time Schrodinger equation for the function /(R, t) ~ 5't(R)^(R, t) is rearranged 




Fig. 4. Polynomial Fade functions with curvatures P ranging from -0.99 to 100. 
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as given by 

_dfi^ = - ^VV(R,r) + V • [vb(R)/(R,t)] + [El(R) - i;T]/(R,T) 

/ ynioc^T(R) T4;oc^(R, r) I 
^\ *t(R) *(R,r) J-^^^'^^' ^'^^ 

where y„/oc denotes the sum of all nonlocal pseudopotential components from all ions. Note 
that, unfortunately, we do not know how to calculate the last term since it contains the solution 
which is a priori unknown. We therefore introduce the localization approximation by neglecting 
the last term in Eq. (71). It has been shown [67] that the error in energy from the localization 
approximation cx — $o)^- However, since the Hamiltonian is modified due to the neglected 
term, the obtained energy is not necessarily an upper bound to the true eigenenergy of the orig- 
inal Hamiltonian. Fortunately, accurate Jastrow-Slater trial wave functions enable to carry out 
calculations without appreciable impact from the localization error. Calculations have shown 
that the localization approximation bias is typically smaller then the fixed-node bias [68, 69]. 
Therefore this approximation proved to be less of a problem in practice since the localization 
error is overshadowed by the fixed-node error. The issue of the upper bound has been resolved 
some years ago once ten Haaf, van Bemmel and co-workers [70, 71] suggested modification of 
Hamiltonians with nonlocal terms which does not violate the variational principle. Let us write 
the matrix element of the nonlocal part as 

(R\Vnioe\n') = (R\V+jn') + (R\V-,JR') (72) 

where {R\V^i^JR') are matrix elements for which (R|1/+^^|R')'I't(R)4't(R') > and vice 
versa for (R|V17„„|R')- For small times we can write 

* ' ft too ' ' 

(R| - ArVniocln') = 5(R - R') - (R|F„/oc|R')At + 0[{ATf] (73) 

so that it is clear that the negative matrix elements do not flip the walker sign and can be sampled 
without introducing any sign problems. The sign flipping (positive) part is evaluated by the 
projection onto the trial function very much like in the localization approximation, i.e., 



= *t(R)-^ j (R|yUR'>*T(R')dR' (74) 
so that the new modified Hamiltonian has a form 

n = Hioc + V+j + j (R|y„7„JR')dR' + F-,,(vI/t(R) = 0) (75) 

where T-Lioc contains all the local parts such as kinetic energies and local interactions including the 
local components of pseudopotentials. The the node term enforces the fixed-node approximation 
explicitly (otherwise V^^j would be unbounded at the nodes). Interestingly, one can explicitly 
show that 

(*T|?i|*T) > (*T|(^ioc + Kioc)|*T) (76) 
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for any '1'^ while the equality takes place when "^t = '^o, i-S-, the energy of the modified 
Hamiltonian is always an upper bound to the original H = Hioc + Vnioc- New algorithmic 
developments by Casula and coworkers [72] enabled to implement this treatment of nonlocal 
operators into the DMC methods as the so-called T— moves. 

Pseudopotentials are also very useful for heavy elements which require inclusion of relativis- 
tic effects. The scalar relativistic corrections can be built-in into the pseudopotentials and for 
very heavy elements this is basically the only choice if one wants to avoid the full relativistic 
treatment. 

Although this section seem somewhat specialized and technical, the results have wider impli- 
cations. Note that the last development enables us to work with any type of nonlocal operators 
using the decomposition from Eq. (72) and the effective upper-bound Hamiltonian from Eq. (75). 
Clearly, this opens possibiUties for simulations with more complicated Hamiltonians, beyond just 
electrons and ions. 



2.6 Optimization of Variational Wave Functions 

The quality of trial wave functions has a significant impact on the efficiency of VMC and DMC 
methods as well as on the accuracy of the final results. The presented functional forms typically 
depend on a number of linear and non-linear parameters which has to be optimized. The opti- 
mization of functions is a classic numerical problem and many approaches can be found in the 
literature. However, in combination with random fluctuations and objective functions defined 
by the finite ensemble of walkers, the problem becomes more comphcated. We will therefore 
outline a few methods used for this purpose. 



2.6.1 Variance Minimization 



Let us denote the set of variational parameters {c} of some real valued trial wave function ^'{c}- 
The mean value of the local energy with respect to is given by 



4tc = rti = (4^^> - Ei'^^- (77) 



The variance of the local energy can be written as 



''{^> - f%ydIL - ^^^^ 



= ((i?r>-^W)2). (78) 



Since the variance directly determines the efficiency one possible approach for wave function 
optimizations is minimize the variance [56] in Eq. (78) using a finite set of fixed configurations, 
where the walkers are distributed according to ^{co}' starting variational parameters is 

denoted as {cq}. The variance optimization is a rather robust strategy since it is always positive 
and bounded from below. In practice, the mean value of the local energy is not a priori known, 
so we replace E^'^^ in Eq. (78) by a reference energy Eref- 

The method can be further improved. One possibility is to use weights which account for the 
wave function change after the parameter updates. The re-weightened variance with the new set 
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of parameters {cnew} is then given as 

where we have introduced the weighting function 

^ic„.„>(R) = -^-^. (80) 

The advantage of the reweighting scheme is a more accurate value of variance at each optimiza- 
tion step. However, once the wave function departs significantly from its original distribution 
one has to update the walker positions as well to reflect the updated wave function otherwise the 
method becomes unreliable. 

For minimization algorithms we use two different methods. The first one is a modified ver- 
sion of a quasi-Newton method [73]. It uses only the values and numerical gradients of the objec- 
tive function (in this case the variance) and builds up the inverse Hessian from updates along the 
optimization trajectory. Second algorithm is based on the general Levenberg-Marquardt (LM) 
method. It can be smoothly tuned between Newton and steepest descent approaches with a built- 
in stabihzation parameter. However, it requires both gradient and Hessian and is therefore more 
costly to calculate. The gradient of the variance with respect to i-th parameter is readily obtained 
by differentiating Eq. (78) as 



iE.AE. - E)) + {^El) - {^){El) -2E-{^iE,-E) 



(81) 



where the subscript i denotes Since the variance minimization method can be viewed as a 
fit of the local energy for a fixed set of configurations [56], by ignoring the change of the wave 
function an alternative expression for the same gradient follows as 

a1=2{EL,i{EL-E)). (82) 

The Hessian derived from the gradient (82) is then given as 

al=2{{EL,i-E){EL^^-E)). (83) 

An important property of this approximate Hessian (83) is that it is symmetric in i and j and 
positive definite even for a finite set of configurations what is not automaticaUy true when using 
the formally exact expression. 



2.6.2 Energy Minimization 

In general, a straightforward minimization of the energy in Eq. (77) can be quite unstable. This 
is somewhat counterintuitive and has its origin in the finite sample. The reason is that for a 
sufficiently flexible variational wave function it is possible to seemingly lower the energy for 
the fixed set of configurations, while, in fact, raising the true expectation value. As we have 
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mentioned earlier, the variance is bounded from below and therefore this problem is not present 
even for the finite sample [56]. Although one can frequently re-sample the wave function so that 
the method is always close to the actual minimum one can do better with some modifications. 

We can improve the method convergence by employing the LM method and information 
about gradient and Hessian of the local energy mean. The corresponding gradient can be readily 
obtained from Eq. (77) as 



E, 



= 2 



2E- 



{El - E) 



(84) 



where we used the fact that H is Hermitian. Note, that the expression in the last step of Eq. (84) 
has a favorable property of zero fluctuations as \1/ approaches the true eigenstate. By differenti- 
ating the Eq. (84) we get the Hessian as 



Ei, 



$2 



{El 



Ei + 



E) 



'-E 



(85) 



It is clear that the above Hessian is not symmetric in i and j when approximated by a finite 
sample and the expression leads to quite large noise. Umrigar and Filippi [74] have recently 
demonstrated that the fully symmetric Hessian written entirely in the terms of covariances {{ab) — 
(a) (&)) has much smaller fluctuations then the Hessian from Eq. (85). The modified Hessian from 
Ref . [74] is given by 



Ei. 



+ 



\I/2 



{El-E))- 



\ * 



-E 



+ 



EL,i)- 



{EL,i) 



(86) 



where we have added three additional terms with vanishing expectation value (for proof, see 
e.g. Ref. [75]). The terms make the Hessian symmetric while at the same time decreasing the 
variance. Hence, addition of terms with zero expectations helps to cancel most of the finite 
sample fluctuations and makes the minimization method vastly more efficient. 
Another useful rearrangement of the Hessian (86) is 



Eij =2 



^2 



\ * 



+ 



E 



{Elj 



+ 



{El 



E) 



Er 



{El 



(87) 



If the minimization procedure involves only the linear parameters in the exponential (i.e., of the 
form exp[c(i)/]) such as hnear coefficients in the Jastrow factor, the two terms on the first Une 
of Eq. (87) cancel out, eliminating thus the need for evaluation of the term. For details on 
Levenberg-Marquardt method and other minimization see, for example, Ref. [76]. 
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3 Nodal Properties of Fermionic Wave Functions 
3.1 Introduction 

For a long time, nodes of fermionic wave functions and also related nodes of density matrices 
were considered uninteresting, since they are very complicated and their usefulness or connection 
to physical effects were unclear. For a given real state *(R), the node is defined as the subset 
of R-space for which \I/(R) = 0. The node divides the configuration space into nodal domains 
with constant wave function sign, "+" or Interestingly, the first ideas on nodal properties 
of eigenstates go back to Hilbert and later to Courant. In particular, Courant proved the famous 
theorem that the n— th eigenstate of the 2D Laplacian on simply connected convex 2D regions 
with Dirichlet or Neumann boundary conditions have n or less nodal domains. Later this has been 
further expanded into higher dimensions and more complicated Hamiltonians. In the context of 
physics problems, Breit analyzed the nodes of two-electron states in 1930 and he actually found 
the first exact node for the atomic ^P(2p^) state [77|. More recently, the behavior of nodes 
for highly excited (quasi-classical) states were studied and various bounds on nodal (Hausdorff) 
volumes for n — >^ oo were established. As we have seen, the nodes play an important role in 
eliminating the fermion sign problem and knowledge of exact nodes would enable us to obtain 
fermionic ground states essentially exactly. 

In relation to QMC methods, the nodes were perhaps for the first time encountered and identi- 
fied as a "problem" in the paper by Anderson in 1976 [41]. The fermion nodes of small systems, 
mostly atoms, were investigated in several other papers [12, 40, 78-80]. The fermion nodes for 
degenerate and excited states were further studied by Foulkes and co-workers [81]. Bressanini, 
Reynolds and Ceperley demonstrated differences in the nodal surface topology between Hartree- 
Fock and correlated wave functions for the Be atom and explained thus the large impact of the 
2s, 2p near-degeneracy on the fixed-node DMC energy [82]. More recently, improvements in 
fixed-node DMC energies of small systems were studied by using CI expansions [83, 84], and 
pairing wave functions [1,2, 85, 86]. 

Some of the general properties of fermion nodes were analyzed in an extensive study by 
Ceperley [87], which included proof of the tiling and connectivity properties as well as gener- 
alizations of the notion of nodes for density matrices. From few-electron systems as well as 
from numerical studies of free particle electron gas in 2D and 3D [87] came the conjecture that 
fermionic ground states have only two nodal cells (one "+", one "-"). The paper laid down some 
of the ideas, however, explicit demonstrations were missing even for non-interacting systems. 
Explicit proofs of this conjecture for several important cases was found later [88, 89], when 
one of us used the property of connectivity to show analytically that a number of spin-polarized 
non-interacting and mean-field systems (non-interacting homogeneous electron gas, HF atomic 
states, harmonic fermions and fermions on a sphere surface) have ground state wave functions 
with the minimal number of two nodal cells [88, 89]. Furthermore, using the Bardeen-Cooper- 
Schrieffer wave function, it was shown that for in spin-unpolarized systems (i.e., with both spin 
channels occupied) an arbitrarily weak interaction reduces the four non-interacting nodal cells 
again to the minimal two. Finally, in the same paper was also demonstrated that the minimal 
number of nodal cells property extends to the temperature density matrices. 

From the point of view of QMC methods, the two key issues are the nodal topologies and the 
nodal shapes and their efficient description. The topologies are important since they determine 
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how the configuration space domains are connected, property crucial for the correct sampHng 
process. In addition, the topologies are also related to physical effects such as multiple particle 
exchanges and pairing. The shapes are important for the accuracy and so far our abiUty to de- 
scribe these with sufficient flexibiUty is quite linnited. Some recent advances on this aspect will 
be presented later in the review. 

This section is organized as follows. In Sec. 3.2, we outline some of the general properties 
of fermion nodes. In Sec. 3.3, exact fermion nodes are described for few-electron spin-polarized 
atomic states, including the only known three-electron case discovered very recently. In Sec. 3.4, 
we categorize the nodal surfaces for the several half-filled sub-shells relevant for atomic and 
molecular states. In Sec. 3.5, we show how opposite spin correlations eliminate the redundant 
nodal structure of HF wave functions for cases of spin-unpolarized states. 

3.2 General Properties 

Let us consider a system of N spin-polarized fermions in d dimensions described by a real 
antisymmetric wave function ^(R). We assume that ^'(R) is an eigenstate of a Hamiltonian 
which, for simplicity, contains only local interactions. Consequently, there exists a subset of 
electron configurations called a fermion node, for which the wave function vanishes so that we 
can specify the nodal subset by an implicit equation 

*(R) = 0. (88) 

Assuming that \E'(R) describes the ground state, from the nodal subset we omit regions where 
the wave function might vanish because of other reasons (e.g., external potential, additional sym- 
metries, etc). Note that when we talk about the nodes, we have in mind the nodes of many-body 
wave functions in the space of all particles. These nodes have nothing to do with familiar nodes 
of one-particle orbitals such as the angular nodes of spherical harmonics Yi^ or radial nodes of 
radial one-particle orbitals. In general, the many-particle state node is an {Nd — 1) -dimensional 
hypersurface defined by the implicit Eq. (88). Whenever positions of any two electrons coincide 
(i.e., Tj = Yj), the antisymmetry ensures that the ^(R) = 0. For d > 1, these coincidences 
do not fully specify the nodes, but only define {Nd — rf) -dimensional subspaces of coincidence 
planes. As we will see, the coincidence planes will play an important role for the nodes in ID 
systems. 

Another definition which we will need is the nodal domain. The node divides the configu- 
ration space into subsets in which the wave function sign is constant and which are called nodal 
domains (or cells). The nodal domain definition can be reformulated also in terms of paths in the 
configuration space. A nodal domain fi(Rt) is a subset of configurations which can be reached 
from the point Rj by a continuous path without crossing the node. 

Let us now introduce the basic properties of wave function nodes. 

(a) Manifold almost everywhere — Uhlenbeck [90] proved that the node is generically a man- 
ifold, i.e., it can be locally mapped onto a {dN — 1) -dimensional plane except, possibly, 
for a subset of zero measure. This assumes reasonable Hamiltonians and a few other gen- 
eral conditions. Note that this does not preclude two or more hypersurfaces to cross since 
crossing is automatically lower dimensional, i.e., of zero measure. 
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(b) Nodal crossing angles — If two nodal surfaces cross each other, they are orthogonal at the 
crossing. If n nodal surfaces cross each other, the crossing angles are all equal to n/n. 

(c) Symmetry of the node — The symmetry of the node cannot be lower than the symmetry of 
the state. Typically, these symmetries are the same, however, there are exceptions. There 
are known exact cases with the node synmietry being higher than the state synmietry. 

(d) Tiling property for non-degenerate ground states — The tiling property says that by ap- 
plying all possible particle permutations to an arbitrary nodal cell of a ground state wave 
function one covers the complete configuration space (i.e., n(PRt)-|-n(Rt) = fH). 
Note that this does not specify how many nodal cells are there: the number of nodal do- 
mains can be any even number between two and A^!. We will use this property later. 

Let us analyze the nodes for several models. It is instructive to start with ID systems for 
which the exact ground state nodes can be found explicitly and for arbitrary interaction. The 
reason is that in ID the coincidence and the nodal hypersurfaces are identical due to geomet- 
ric constrains of ID space. Let us first consider TV spin-polarized fermions in a ID harmonic 
oscillator well. The wave function is given by the Slater determinant 

N) = det[0fe(r,)] = A n e""' /'det[l, 2x, Hn-i{x)], (89) 

i 

where Hn{x) is a Hermite polynomial of degree n and A is the normalization. We omit the 
prefactors and transform the Slater matrix to monomials so that the wave function is given by the 
Vandermonde determinant 

*ic(l,...,A^) =det[l,ar,ar^...,ar^-l] = l[{xj - Xi). (90) 

The node is encountered whenever two fermions pass through each other and the wave function 
has N\ nodal cells, since any permutation requires at least one node-crossing. In general, the 
derived node is exact for other ID models including systems with arbitrary local interactions 
(which are not too singular). For periodic boundary conditions one can find a similar expression. 
Let us consider N = {2kF + 1) particles in a ID periodic box (— tt, tt), so that the coordinates are 
dimensionless. In this setting the Fermi momentum becomes an integer, kp = 1, 2.... The one- 
particle occupied states are written as (l>n{x) = e*"^, n = 0, ±1, ikp and the spin-polarized 
ground state is given by 

*iz5(l,...,7V) =dct[0„(xj)], (91) 

where xj is the j-th particle coordinate and j = 1, A^. The term exp{—ikF J2j ^j) 
factorized out of the determinant so that the Slater matrix elements become powers of zj = e'^^ . 
This leads to the Vandermonde determinant and after some rearrangements we find 

..•,N) = e-'^^^o-i ^{zj - Zk) = A JJ sin(a;,fc/2) (92) 

j>k j>k 

where Xjk = Xj — Xk and A is an unimportant constant prefactor. The periodic boundary con- 
ditions result in the same ID nodal hypersurface which is also manifestly periodic, as expected. 
Again, for ID periodic ground states these nodes are exact irregardless of interactions. 
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Properties of nodes for d > 1 are very different and also far more difficult to analyze than in 
one dimension. The key difference is that the node dimensionality is higher than the dimension- 
ality of the coincidence hyperplanes. Clearly, for > 1 particles have enough space to "avoid 
each other" and that has implications on the nodal topologies and the number of nodal cells. In 
addition, this spatial or dimensional flexibility also means that the nodal hypersurfaces can be 
and are affected by interactions. 

We first consider spin-polarized non-interacting fermions in a 2D harmonic well. The one- 
particle states are simply (pnm = CnmHn{x)Hm{y), n,m = 0, 1, ... where Cnm includes the 
Gaussian and normalization which are absorbed into a common prefactor and omitted. The Slater 
matrix elements can be rearranged to monomials as follows 

^2D{l,-,N)=dct[l,x,y,...,x"y"',...]. (93) 

The closed-shell states and the system size are conveniently labeled by M — 1, 2, ... where 
n + m < M, with the corresponding number of fermions given by = (M + 1)(M + 2)/2. 

Let us illustrate the qualitative difference in nodal topologies between higher dimensions 
and ID systems on the following example of three harmonic particles. For M = 1, the three- 
particle wave function is given by ^'2d(1, 2, 3) = det[l, x, y]. In order to understand the nodal 
domain topology, it is convenient to extend the 2D coordinates by a "dunrniy" third dimension 
as Ti = {xi, yi,0). Then 5'2d(1, 2, 3) = zq • (r2i x rai) where Zq is the unit vector in the third 
dimension and = — rj . Note that while in ID three particle state exhibits 3! = 6 nodal cells, 
the result we just derived implies something very different. There are only two nodal domains 
since the set of vectors Zo,r2i,r3i is either left- or right-handed, and the node is encountered 
whenever the three particles become coUinear, i.e., r^i x r^i = 0. Another important property 
which will be explored in detail below, is the fact that it is possible to carry out a triple exchange 
1— >2, 2^-3, 3^1 without node-crossing (e.g., rotate an equilateral triangle). 

The triple exchanges are important for proving the two-nodal cell property in general cases 
for arbitrary number of particles. In order to explain this aspect, we introduce a notion of particles 
connected by triple exchanges. The three particles i, j, k are called connected if there exist a 
cyclic exchange path i^j,j-^k, k^i, which does not cross the node. Let us now suppose 
that there exist a point R* inside a nodal domain such that all particles are connected into a single 
cluster of triple exchanges (see Fig. 5). Then ^'(R) has only two nodal cells. In other words, the 
whole configuration space is covered by only one positive and one negative nodal cell (i.e., the 
nodal cells are maximal). Let us try to understand why this is the case. First, any triple exchange 
is equal to two pair exchanges. The existence of the point Rj with all particles connected then 
means that all positive (even) permutations P+Rf can be carried out within the same nodal cell. 
Second, the tiling property implies that once all particles are cormected for R^ the same is true 
for entire cell rJ(Rt). Therefore, there will be only one maximal cell per each sign. More details 
on this construction can be found in Ref. [87]. 

We are now ready to show that six particles (M = 2) in the harmonic 2D well also have the 
ground state with the minimal nodal count of two. We position particles 1, 2 and 3 so that they 
have common y— coordinate, t/o- For M = 2 the wave function reads 



*2d(1,-,6) = d.et[l,x,y,x^,xy,y'^] = 
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All-particle 
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space 



Connected 
cluster 
configuration 




Fig. 5. Sketch of a connected cluster of triple exchanges. Each triangle means that there is a triple exchange 
path which does not cross the node. 
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Fig. 6. Positions of particles for proving the connectivity of six 2D harmonic fermions. 
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where the particle configuration which will enable to figure out the connectivity is given in Fig. 6. 
All elements containing yo can be eliminated by appropriate rearrangements and the determinant 
factorizes as 

6 

*2c(l, 6) = Hivi - yo)*ii3(l, 2, 3)*2i3(4, 5, 6). (95) 

i=4 

It is now clear that we can rotate the triangle of particles 4, 5, 6 without crossing the node since 
^2£)(4, 5, 6) remains constant and the wave function does not change the sign. That implies 
that particles 4, 5, 6 are connected. However, the wave function is rotationally invariant so the 
particles 1, 2, 4 must be connected, as well as the particles 2, 3, 5. Therefore all six particles are 
connected and the wave function has only two nodal cells. 

With some effort it is possible to generalize these types of constructions and show that har- 
monic particles are connected for any closed-shell state with arbitrary M — > oo and for arbitrary 
d as given elsewhere [88]. 

It seems that the harmonic oscillator might be somewhat special due to the simple structure 
of the wave function, since the nodes are zeros of antisymmetric polynomials [89]. In fact, the 
minimal two-nodal cells property appears to be generic. Let us present an example which is 
more realistic and does not reduce to simple polynomials. Consider an atomic spin-polarized 
state ^S{ls2s'iip'^), i.e., 5 electrons in the Coulomb —Z/r potential for both non-interacting and 
HF wave-functions. The wave function can be written as 



^at{l,-,^)=dei[pUr),p*^^{r),x,y,z\, (96) 

where p\^{r) = Pu{r)/p2p{r) and p^^ir) = P2s{r) / P2p{r) since non-negative p2p{r) is fac- 
torized out. The coordinates become dimensionless by rescaling with the atomic number Z as 
r ^ Zr. Let us position particle 1 at the origin and particles 2 to 5 on the surface of a sphere with 
the radius rjo equal to the radial node of P2s{'r) orbital, i.e., p2s{vo) = 0. For such configurations 
we obtain 



*at(l, 5) = P*isim)P*2s{0)^32 ■ (r42 X Y52) (97) 

Three-particle exchanges 2, 3, 4, 5 easily avoid node-crossings by appropriate positioning and 
rotations on the sphere, e.g., using positions of the tetrahedron vertices. We can also show that 
particle 1 is connected by the exchange 123 — > 312 which is parameterized as ri (t) — 779 [t , 0, 0] ; 
Y2{t) = ?7o[c(i), s(f), 0] and Y^it) = 7?o[0, 1 — t,Q\ where t = Q {t — \) corresponds to the 
beginning (end) point of the exchange path while c{t) = cos(7rf/2) and s{t) = sin(7rt/2). 
Setting r4 = [0, 0, r/o] and rs = [0, 0, — ryo] we find that ^at is proportional to 

P*2siino)c{t){l -t)+ pU{1 - t)r)Mt)t > (98) 

The inequality holds for the whole path < t < 1 since P2s{t'no) > for < f < 1 for 
both non-interacting and HF cases. Note that the demonstration is non-trivial since the HF wave 
function reflects the mean-field interactions and the orbitals can be found only by a numerical 
solution. Nevertheless, qualitative features of the HF and non-interacting orbitals (symmetries, 
structure of radial nodes, etc) are basically identical and enable to verify the nodal cell count for 
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both cases. Note also that the wave functions is not a multi-variate polynomial like for the case 
of harmonic potentials. The proof can be further extended to more shells with more electrons 
such as Is2s2p^3s3p^ and Is2s2p^3s3p^3d^ [89]. 

Similar two-nodal cell demonstrations can be carried out for many paradigmatic quantum 
models such as 2D and 3D homogeneous fermion gas, fermions on a sphere, etc [89]. In addition, 
the same can be shown also for the temperature density matrices, for which one can define the 
nodes in similar way as for the wave fimctions. 



We assume the usual electron-ion Hamiltonian and we first investigate a few-electron ions fo- 
cusing on fermion nodes for sub-shells of one-particle states with ,s,p, rf, ... symmetries using 
variable transformations, symmetry operations and explicit expressions for the nodes. 

3.3.1 Two-Electron Atomic ^P(p^), ^S{ls2s) and Molecular ^^^(Tr^) IViplet States 

Apparently, the exact node of the atomic ^P{p'^) state was derived in a different context by Breit 
in 1930 [77, 83, 91]. Here we offer an independent proof which enables us to apply the analysis 
to molecular states with the same symmetries. The state is the lowest within its symmetry class. 
It has even parity and cylindric symmetry, say, around z-axis. It is also odd under rotation by tt 
around any axis orthogonal to the cylindric axis. We denote the rotation by tt around, say, the 
x— axis, as R{'kx). In order to uncover the exact node it is convenient to define new coordinates. 
Let us denote r^2 = ri+r2,rj^2 = | 1' together with the customary r 12 = ri — r2,ri2 = |ri2|. 
We can now introduce the following coordinate map 



where cosuj = zq ■ (ri x r2)/|ri x r2| with zq being the unit vector along the z-axis, cos /? = 
''12 ■ ''i2/(?' 12^12)' ™d (fi' being the azimuthal angle of ri x r2. Further, (f is the azimuthal angle 
of in the relative coordinate system with the x-axis unit vector given by a projection of zq into 
the plane defined by ri, r2, i.e., e^; = zop/|zop|, = (ri x r2)/|ri x r2| and Gj, = x Gj;. 
The coordinate ip' can be omitted from further considerations right away due to the cyhndric 
symmetry so that the wave-function dependence simplifies to ^ {r f 2, r 12, cos ^, cos uj,(f). We 
introduce an operator Q which acts on coordinates and is given hy Q = PjPi2R(nx) where Pj 
inverts around the origin and P12 is the pair exchange of particles I and 2. Considering the state 
symmetries, action of Q reveals that the wave fimction is invariant in the simultaneous change 
(cos 13, if) — >■ (— cos P,—ip). Combining action of Q and P12 we find that 



with the rest of variables unchanged. The node is therefore given by cos oj — 0, i.e., Zq • (ri •r2) = 
0, and it is encountered when both electrons happen to be on a plane which contains the z- 
axis. This exact node therefore agrees with the node of non-interacting or HF wave function 
^ = det[p{r)x, p{r)y]. 

The derived node equation is applicable to any cylindric potential with Dcoh symmetry, e.g., 
equidistant homo-nuclear dimer, trimer, etc, with one-particle orbitals tTx, ny which couple into 
the triplet state ^Eg(7rx%). 



3.3 Exact Nodal Surfaces 



(ri,r2) (rj'2,ri2,cosa;,cos/3,</9,<^')> 



(99) 



^(..., — cosa;, ...) = —'!'(..., cos w, ...) 



(100) 
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Note that the parametrization given above leads to the exact node for the lowest two-electron 
triplet state, "^5(1525) which has been known for some time [12, 78]. The spherical symme- 
try makes angles cj and (/? irrelevant and simplifies the wave function dependence to distances 
ri,r2,ri2 or, alternatively, to ri2 , r J*^ , cos l3. Applying P12 leads to 

-*(ri2,r^2.cos^) = *(ri2,r^2>-cos^), (101) 

so that the node is given by the condition cos/3 = 0, i.e., ri — r2 = 0, or, equivalently, rf = 
r^. The last equation clearly shows that the node is generically a quadratic manifold, namely, 
a 5D hyperboloid in 6D space (one recalls that in 3D space the 2D hyperboloid is given as 
± const = x"^ — y^). Again, the nodal domain count is two and the domains correspond to 

ri > r2 or ri < r2. 

For the presented states the exact nodes are determined solely by symmetry and are indepen- 
dent of interactions. That further implies that all the excited states with the same symmetries will 
contain the ground state node although they will exhibit also additional nodal hypersurfaces. We 
will come back to this point in the following subsection. 

3.3.2 Three-Electron Quartet ^5(2^3) State 

Recently, we have found another case of exact node for a three-electron atomic state, namely, the 
lowest spin quartet of S symmetry and odd parity, ^S{2p'^). It is actually very easy to find the 
node by transforming it to the previously analyzed cylindric symmetry. We can always rotate the 
system so that one of the electrons, say, the electron 3, is positioned on the z— axis so that the 
remaining symmetry is only cylindric. However, then the wave function for the remaining two 
electrons is identical to the previous ^P{2p^) state and we can directly write the nodal condition 
as r3 • (ri x = 0. The wave function vanishes when all three electrons appear on a plane 
which contains the origin. 

One can demonstrate the same by defining an alternative coordinate map cos a = rs • (ri x 
i'2)A3|''i X r2|, cos/3 = [(ri x r2) x r^2] •ri2/[|(ri x ^2) x ri"2||r^2l] and 7 being the azimuthal 
angle of in the relative coordinate system given by the following unit vectors, = i" 12/^12' 

= [[(ri X r2) x ri2] x rj'2]/|[(ri x r2) x ri2] x rj'jl and e^, = x e^:. Besides these relative 
coordinates, the three Euler angles fix the orientation in the original coordinate system, however, 
the S symmetry makes them irrelevant. In the new coordinates the node comes out by the action 
of P\2 on 4'(rj^2i '''12, rs, cos a, cos /3, 7) as 

— cos a, cos /3, 7) ~ — cos a, cos ^, 7). (102) 

It is clear that P2?, and P13 would lead to the same nodal condition, assuming one would redefine 
the relative coordinate map accordingly. The node of non-interacting and HF wave function 
"^HF ~ dot[/9(r)a;, p(r)y, p(r)z] again agrees with the exact condition. As mentioned before, 
the combination of symmetries is so restrictive that interactions leave the node invariant and this 
is in fact the case also for the excited states. Therefore all the excited states must contain the 
ground state fermionic node. This is illustrated in Fig. 7 which shows the groimd state (planar) 
node together with additional nodes, as expected for excitations. 

The excitations have therefore special symmetry structure which enables to show that the 
one-particle s channel is absent from the whole spectrum in this synmietry sector. One can 
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Fig. 7. The 3D projected nodes of a few selected excitations for the symmetry adopted CI expansion of 
the *5(p'^) ground state. The exact planar node of the quartet ground state is also possessed by all the 
excitations. The small spheres show the fixed positions of two electrons while the third one is scanning the 
nodal surface. Labels indicate the types of excitations. 

demonstrate this as follows. We expand any state of this symmetry in Unear combinations of 
excited determinants 



where the sum is over the complete one-particle spectrum (including scattering states). Due to 
the selection rules coefficients Cijk vanish for any excitation which contains one or more I = 
orbitals. Consider first states with = 0. That implies that all determinants in this configuration 
must have Ij = If^ otherwise the Clebsch-Gordan coefficients for S symmetry vanish. However, 
then the excitation would be of even parity and therefore such configurations drop off. The only 
remaining possibility is that li = Ij = 0. Such configurations then require that also 1^ — 0, 
however, such states with odd parity do not exist. Therefore the one-particle channel with s 
symmetry is "switched-off" completely. This property was useful for showing that the fixed-node 
calculation of this state provided the exact energy even when using nonlocal pseudopotential with 
s non-locality [92]. 

In order to illustrate how exact and approximate nodal surfaces affect the fixed-node DMC 
results we studied the valence-electron states of the spin-polarized nitrogen cation, some of which 
were analyzed above. The core electrons were eliminated by pseudopotentials [93]. The trial 
wave function was of the commonly used form with single HF determinant times the Jastrow 
correlation factor [5]. As we explained above, the pseudopotential nonlocal s— channel does 
not couple to either the odd parity S state or the even parity P{p^) state so that the nonlocal 
contribution to these states vanishes exactly. 

In order to compare the fixed-node QMC calculations with an independent method we have 
carried out also configuration interaction calculations with ccpV6Z basis [94] (with up to three 
g basis functions), which generates more than 100 virtual orbitals in total. In the CI method 
the wave function is expanded in excited determinants and we have included all single, double 
and triple excitations. Since the doubles and triples include two- and three-particle correlations 




(103) 
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Tab. 1. Total energies (in Hartrees) of N"*", N"*"^ and N"*"^ ions with core electrons eliminated by pseu- 
dopotentials [93]. The energies are calculated by HF, configuration interaction (CI), variational (VMC) and 
fixed-node diffusion Monte Carlo (DMC) methods. 



State 


HF 


CI 


VMC 


DMC 




-5.58528 


-5.59491 


-5.59491(2) 


-5.59496(3) 




-7.24716 


-7.27566 


-7.27577(1) 


-7.27583(2) 




-8.98570 


-9.02027 


-9.01819(4) 


-9.01962(5) 



exactly, the accuracy of the CI results is limited only by the size of the basis set. By comparison 
with other two- and three-electron CI calculations we estimate that the order of magnitude of the 
basis set CI bias is Q.Ol mH for two electrons and « 0. 1 mH. and for three electrons (despite 
the large number of virtuals the CI expansion converges relatively slowly [95] in the maximum 
angular momentum of the basis functions, in our case Imax = 4). The pseudopotentials we used 
were identical in both QMC and CI calculations. 

The first two rows of Tab. 1 show the total energies of variational and fixed-node DMC calcu- 
lations with the trial wave functions with HF nodes together with results from the CI calculations. 
For ^P{p'^) the energies agree within a few hundredths of mH with the CI energy being slightly 
higher but within two standard deviations from the fixed-node QMC result. For ^S{p^) the CI 
energy is clearly above the fixed-node DMC by about 0.17 mH which is the effect of the finite 
basis. In order to illustrate the fixed-node bias in the case when the HF node is not exact we have 
also included calculations for four electron state ^S{sp^) (for further discussion of this Hartree- 
Fock node see Sec 3.4.2 below). For this case, we estimate that the CI energy is above the exact 
value by « 0.3 mH so that the fixed-node energy is significantly higher than both the exact and 
CI energies. Using these results we estimate that the fixed-node error is « 1 mH, i.e., almost 3% 
of the correlation energy. 



3.4 Approximate Hartree-Fock Nodes 

It is quite instructive to investigate the nodes of half-fiUed sub-sheUs of one-particle states with 
higher angular momentum. We will explore the nodal hypersurface projections into the 3D space 
to understand what an individual electron "sees" during the imaginary time evolution. The study 
also helps to classify nodes for various cases of atomic and molecular systems. For the next few 
subsections we will closely follow the derivations from our paper [92]. 

3.4.1 Hartree-Fock Node of ^S{d^) State 
The HF determinant wave function for ^S{d^) is given by 

5 

^HF = JJp(ri)det[222 _ _ y2^^2 _ y"^ ^xz,yz,xy], (104) 

i=l 
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Fig. 8. The 3D projected Hartree-Fock node of ^S{d^) state, which is an elliptic cone (left and right 
pictures). The picture in the center shows a configuration of two pairs of electrons lying on two orthogonal 
planes which contain the origin. This particular node is of lower dimension because of the additional 
constraint on positions of the electrons. It appears as a crossover between the elliptic cones with different 
orientation (left and right pictures). The small spheres show the positions of the four electrons while the 
line denotes the axis. 



where p{ri) is the radial part of the Sd-orbital. Since the radial prefactor is irrelevant it can be 
omitted. The S symmetry allows to rotate the system so that, say, electron 1 is on the z-axis, 
and then the corresponding column in the Slater matrix becomes (2z^, 0, 0, 0, 0). Assuming that 
zi 7^ we can then write the nodal condition as 

det[x'^ — y'^ , xz,yz, xy] = 0. (105) 

Using one of the electrons as a probe (i.e., looking at the node from the perspective of one of the 
electrons) we can find the projection of the node to the 3D space. We denote the probe electron 
coordinates simply as (x, y, z) and expand the determinant so that we get 

{x^ — y'^)'mi + xzm2 + yzm^ + xym^ — (106) 

where are the corresponding cofactors. We divide out mi assuming that it is non-zero (not a 
crucial assumption as clarified below). We get 

(a;^ — y^) + axz + byz + cxy — 0, (107) 

where a = 1712/1711, b = m^/mi, c — mi/ mi. By completing the square this can be further 
rearranged to 

{x - kiy){x - k2y) + z{ax + by) = Q (108) 

with fci.2 = (— c ± Vc^ + 4)/2. Let us define rotated and rescaled coordinates 

u* = -{ak2-b){x-kiy)/{ki-k2) (109) 
V* ^{aki~b)[x-k2y)/{ki-k2) (110) 
w* = z[(afci - b){ak2 - b)]/{ki - fca)^ (111) 
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Fig. 9. The 3D projection of the nitrogen cation ^S{sp'^) Hartree-Fock node (the core electrons are 
eliminated by pseudopotentials). The projected node exhibits two topologies. It is either a planar surface 
deformed by the radial orbital functions at the nucleus or, in certain configurations, the deformation forms 
a small bubble detached form the surface (the picture on the right). The small cross is the location of the 
ion while the small spheres denote positions of electrons. 



so we can write the Eq. (107) as 



Note that this equation has a form which is identical to Eq. (106) with mi = so this represen- 
tation is correct for any mi. After some effort one finds that Eq. (112) is a cone equation (i.e., 
orbital) as can be easily verified by using the following identity 



where u = u* +v* + 2w* , v — {—u*+v*+2w*), w — {u* —v* + 2w*). The 3D projected node 
is therefore an elliptic cone. Note that its equations is the same as rotated and rescaled angular 
part of the (1^2 orbital, — x'^ — y^. 

Therefore the 3D projection of the HP node is a family of cones given by the homogeneous 
second-order polynomial in three variables with coefficients determined by the positions of the 
four electrons. From the theory of quadrics [96], one finds that a general elliptic cone can 
possibly "fit" up to five 3D points/electrons. However, in our case there is an additional constraint 
of one electron being fixed on the z-axis. That implies that the z-axis Ues on the cone so that the 
cone always cuts the xy plane in two lines, which are orthogonal to each other The orthogonality 
can be verified by imposing z = in Eq. (108) and checking that kik2 — —1. In addition, one 
can find "degenerate" configurations with two pairs of two electrons lying on orthogonal planes 
(Fig. 8). This corresponds to the "opening" of the cone with one of the elliptic radii becoming 
infinite and the resulting node having a form of two orthogonal planes (Fig. 8). The condition 
is equivalent to A44 — iP' ~ a? ~ abc — 0, where A44 is one of the quadratic invariants [96]. 
There are more special cases of nodes with additional restrictions: (a) when two electrons lie on 
a straight line going through the origin; (b) when three electrons lie on a plane going through the 
origin; (c) when four electrons lie on a single plane. 



u*v* + w*u* + w*v* — 0. 



(112) 



{2u^ - - w^)/8 



— u*v* + w*u* + w*v* , 



(113) 
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3.4.2 Hartree-Fock Nodes of the ^S{sp^) Ion 

The HF node for this spin-polarized state with Is^ (He-core) states replaced by pseudopotentials 
can be studied in a similar way as studied previously [92]. We can expand the determinant 
as in the previous cases with a new feature that the radial parts will modify the shapes of the 
hypersurfaces. The system therefore corresponds, e.g., to the N+ ion from Tab. 1. Note that 
due to pseudized core the one-particle s orbital has no radial node although it corresponds to 
the physical 2s state. By expanding the determinant in the colunon of the probe electron with 
position X, y, z the 3D node projection is simply given by 

x + b'y + c'z + d'r]{r) = 0, (1 14) 

where b' ,c' , d! depend on ratios of cofactors and ri{r) = ps (r) / Pp{r) is the ratio of radial parts of 
s and p orbitals. The probe electron will see a plane with a approximately bell-shape deformation 
in the area of the nucleus (see Fig. 9). The shape of deformation depends on the ratio of s and 
p radial parts and the magnitudes and signs of the cofactors. For certain configurations the 
deformation in a detached, separate ellipsoid-like bubble. The bubble is caused by the radial 
dependence of 77 (r) which for pseudized core is not a monotonic function and therefore can 
create new topologies. Note that despite the fact that the 3D projection shows a separated region 
of space (the bubble) the complete node which is 11 -dimensional has again the minimal number 
of two nodal cells. This can be checked by placing the four particles at the vertices of a regular 
tetrahedron with the center at the ionic center The wave function does not vanish for such 
configuration as can be easily verified. Rotations by 7r/3 around of each of the four three-fold 
symmetry axes of the T4 group then shows that all the particles are connected. 

3.4.3 Hartree-Fock Nodes of Spin-Polarized p^d^ and sp^d^ Shells with S Symmetry 

Let us for a moment assume a model wave function in which the radial parts of s,p, d orbitals 
are identical. Then, using the arrangements similar to d^ case, we can expand the determinant of 
p'^d^ in one column and for the 3D node projection we then get 

2u'^ - v"^ - w"^ + au + I3v + 'yw = 0, (115) 

where u, v, w are appropriate hnear combinations of x, y, z. This can be further rewritten as 

2{u + a/4)2 -{v- /3/2)2 - {w - j/2f + So = 0, (1 16) 

where i5o = + + 7^)/4. It is clear that the quadratic surface is offset from the origin 

(nucleus) by a vector normal to au + f3v + jw = plane. Using the properties of quadratic 
surfaces one finds that for [c)? j(c? +/3^+7^)) < 2/3 the node is a single-sheet hyperboloid 
with the radius \fSQ\ otherwise it has a shape of a double-sheet hyperboloid. The double-sheet 
hyperboloid forms when there is an electron located close to the origin. A special case is a cone 
which corresponds to ((5o = 0). The case of sp^dc' is similar, but with different ^o> which now 
has a contribution from the s-orbital (see Fig. 10). Once we include also the correct radial parts 
of orbitals in the s,p, d channels the coefficients of the quadratic form depend on both cofactors 
and orbital radial functions. The resulting nodal surface is deformed beyond an ideal quadric and 
shows some more complicated structure around the nucleus (see Fig. 1 1) as illustrated on HF 
nodes of the majority spin electrons in Mn+^ ion (note that the Ne-core electrons were eliminated 
by pseudopotentials). 
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Fig. 10. The 3D projection of the angular part of the ^'^ S{sp'^cf') state Hartree-Fock node (with radial parts 
of orbitals identical for all spd orbitals). The projection has a topology of a single-sheet or double-sheet 
hyperboloid. The small cross shows the location of the nucleus while the spheres illustrate the electron 
positions. 




Fig. 11. Projected Hartree-Fock node of S{sp'^d^) of the majority spin valence electrons in Mn"^^ ion. 
The Ne-core electrons are eliminated by pseudopotentials. Note the deformations from the radial parts of 
orbitals, including a small bubble detached from the rest of the surface (the right picture). The green cross 
indicates the nucleus and the red crosses the electron positions. 
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3.5 Changes in Nodal Topologies From Interactions and Spin Correlations 

We conjecture that a non-degenerate ground state of any given symmetry possesses only two 
maximal nodal cells. It was demonstrated for the considered fully spin-polarized systems that 
the corresponding HF wave functions have the desired topology, i.e., two maximal nodal cells. 
On the other hand, for partially spin-polarized and unpolarized systems the corresponding HF 

wave functions of the form 



lead to at least four nodal cells due to the product of two determinants each with two cells. As it 
turned out, this is correct only for special cases such as non-interacting systems with occupations 
in both spin channels and some separable models. In general, such structure is an artefact. It 
appears since two different spins and corresponding antisymmetry requirement on configuration 
subspaces is essentially a special non-uniform symmetry requirement on the wave fiinction. We 
will see below that requirements of additional or special symmetries could lead to the increase of 
nodal counts even in the spin-polarized cases. Interestingly, generic interactions have the effect 
of smoothing out the artificial and redundant nodal structures so that the minimal two nodal cell 
property is restored. This is obvious since it is not clear right away that interactions would drive 
the nodal topology changes, in particular, the changes of this type. We will illustrate this on a 
few examples. 

Let us first analyze the simplest spin-polarized case of three electrons in a central, radi- 
ally symmetric potential. Consider the lowest atomic quartet of S symmetry and even parity is 
^S{ls2s3s). In the non-interacting linnit this state exhibits six nodal cells. The reason is that 
all the single-particle orbitals depend only on radii and therefore the nodes have effectively one- 
dimensional character. We will sketch how interactions change the topology and the number of 
nodal cells decreases to minimal two. In the recent paper [88] we showed this numerically for 
the Coulomb potential. For simplicity, we assume a simpler model of harmonically confined 
electrons while using the same symmetries of states and one-particle orbitals as for the Coulomb 
case. The harmonic oscillator has the advantage that the exponential Gaussian prefactor is the 
same for all states and one-particle orbitals irrespective of the occupied sub-shells. As before, we 
omit the prefactors in all expressions since they are irrelevant for the nodes. The non-interacting 
(or HF) wave function then becomes simply 



since (p2s and are first and second order polynomials in r^, respectively. It is clear that there 
are six nodal domains, because ^hf vanishes whenever rj = rj and there are six permutations 
of ri, r2, rs. Now, we can switch-on the interactions and describe the resulting correlations by 
expanding the wave function in excitations. The excitations into s-orbitals with higher quantum 
numbers do not change the nodal structure and their contributions to the exact wave function 
are actually very small. The dominant excited configuration is ^S{ls,2p, 3p), which captures 
angular correlations in the outer shells pair. We write the correlated, two-configuration wave 
function as 




(117) 



*/fF(ri,r2,r3) = (r| 



(118) 



* = det[(t)u,<f>2s,<t>3s] 

+ ^{det[(f>is, (/)2px, hpx] + {x^y) + {x^ z)} 



(119) 
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where /3 is the expansion coefficient. (This effect can be equivalently formulated also as a Pfaffian 
with triplet pairing). Let us now try to carry out the triple exchange. For this purpose we position 
particles as follows: ri = [0,ra,0], r2 = [0,0,0] and = [rb,0, 0] where Va > H > 0. 
During the exchange the particles first move as follows: 1 — > 2, 2 — > 3 and 3^- [r„ . 0,0]. 
The exchange is then finished by rotating particle 3 by +n/2 around the z— axis to the position 
[0, Ta, 0], completing thus the repositioning 3^1. The particle 1 moves to the origin along the 
y— axis while particles 2, 3 move along the a;— axis while their radii are such that > is 
maintained during the exchange. Clearly, during the exchange the node of non-interacting wave 
function will be crossed twice: when n = and again when n = r^. We will now show 
that for the correlated wave function the exchange can be carried out without crossing the node. 
Using the expressions for the states of harmonic oscillator (f)2px = x and 03^3, = x{r^ — 1) and 
substituting the coordinates as outlined above, one finds that the wave fimction becomes 

^ = (4- rMr^ - rDirl - rf) + ^x^x^] (120) 

We assume that (3 > 0. (In the case /3 < 0, it is straightforward to modify the positions and 
exchange path accordingly: the initial position of the particle 3 should be [—H, 0, 0], it moves to 
[—Ta, 0, 0] and it is then rotated by — 7r/2 around the z-axis.) Typical value of (3 is of the order 
of 10~^ which reflects the fact that the same spins (triplet) correlations are usually very small. 
After a Uttle bit of analysis one finds that this function does not change the sign provided the 
difference between the radii r2 and rs when ri = (r2 + i'3)/2 is sufficiently small. Then the 
following inequality holds 

\r2-r3 + 0[{r3-r2f]\<p, (121) 

which is easy to guarantee with some care when moving the particles along the exchange path. 
The minimum of the radial part appears essentially at the point ri = {r2 + r^) /2 and the value 
is negative. However, the other term keeps the wave function positive providing the negative 
term is not very large. Since the term \r2 — ^3 + 0[(r3 — r2)^] | can be made arbitrarily small by 
adjusting radii r2 , to be sufficiently close to each other one can easily fulfill the inequality. The 
wave function then remains positive along the whole exchange path. The nodes of uncorrelated 
and correlated wave function are plotted in Fig. 12. 

Similar topological change occurs in spin-unpolarized systems. The simplest possible case is 
the Be atom ground state with the electronic state ^S'[l,s^2,s^|. The wave-function can be written 
as a sum of HF and correlation components, i.e., ^' = "S^hf + ci^corrt where a is an expansion 
coefficient. The HF part is given as 

= det^[0i„ </)2,]det^[(/.i„ 02.], (122) 

while the dominant correlating configuration is the 2s^ 2p'^ excitation 

^corr = det^[(l)u, ^2px]<iet^[^is, (t>2px] + {x ^ y) + {x ^ z). (123) 

Clearly, the wave function has the required S symmetry and it is straightforward to show that 
the wave function has only two nodal cells. Consider a plane passing through the origin and 
configure the two spin-up electrons at antipodal points on a circle with radius Va > 0. Place the 
spin-down electrons on the same plane at antipodal points of a circle with different radius, say. 
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Fig. 12. The HF node and correlated node for the ^5'(ls2s3s) central potential state. Note the change in 
topology after the inclusion of the correlation through the dominant additional configuration. The figure on 
the left plots the complete HF node since it depends only on the three radii. The figure on the right shows 
the node only on 3D subspace, there are more nodal "openings" in other dimensions. 



r-f, > and rf, 7^ r^. In this configuration the HF wave function vanishes since particles lie on 
the Hartree-Fock node. In general, however, the correlated wave function does not vanish since 
we have 

* = Cr2i-r43, (124) 

where C is a symmetric non-zero factor. We can now rotate all four particles in the common 
plane by tt and the wave function remains constant. The rotation exchanges the two spin-up 
electrons and the two spin-down electrons simultaneously. This shows that the spin-up and - 
down nodal domains are connected otherwise we would have encountered zeros. Note that doing 
the rotation in one of the spin-channels only, i.e., the two-particle exchange for that spin, would 
necessarily result in hitting the node. The correlation therefore enables the four-particle exchange 
to avoid the node and the example provides an interesting insight into the many-particle concerted 
exchange effects. Similar behavior can be found in homogeneous periodic gas and again can be 
generalized to arbitrary sizes [89]. In the periodic systems one can show that these effects enable 
to wind singlet pairs around the periodic cell without encountering the node providing the path 
is chosen appropriately and also that similar effects is impossible for HF wave functions [89]. 

Multi-particle exchanges are closely related to quantum condensates such as Bose-Einstein 
condensation in bosonic systems or superconductivity in fermionic systems. The interpretation 
of condensates as systems which exhibit macroscopic chains of quantum exchanges goes back 
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to Feynman. In bosonic systems this does not help much and even looks "trivial" since there are 
no nodes to begin with and the emergence of condensates is driven by bosonic correlations. In 
fermionic systems the correct nodal topologies also appear only as a necessary but not sufficient 
condition for the condensation to appear. However, from this point of view it is important that the 
Bardeen-Cooper-Schrieffer wave functions, which will be described later, do exhibit the correct 
nodal topologies as shown elsewhere [88, 89]. 




Fig. 13. The 3D projected nodes for N2 dimer. The HF nodes (left) are bended and distorted due to spin 
correlations into CI nodal surface (right) with just two nodal cells. Sampling of nodes is performed with a 
pair of electrons with opposite spins, which are close to each other; positions of other electrons (crosses) 
are fixed; small spheres indicate ions. 



3.5.1 HF and CI Nodes for N2 Dimer State ^ S+ 

The projected nodal structure of the spin-unpolarized ground state for N2 dimer is shown 
in Fig. 13. The He-core electrons are eliminated by pseudopotentials as in previous calcula- 
tions [93]. The HF wave function with the separation of electrons into independent spin-up and 
spin-down subspaces forms fours nodal cells. In the CI case we can see that the HF nodes have 
been distorted and bended to build up channels, which connect regions of the same sign with 
resulting two maximal nodal cells. 

In order to illustrate the accuracy of the constructed CI nodes we have carried out the fixed- 
node DMC simulations with both HF and CI trial wave functions. For the HF trial wave function 
we have obtained the total energy -19.8395(7) H, which recovers « 94% of correlation energy. 
For the CI trial function with more than 5000 determinants from double excitations into 45 virtual 
orbitals with ccpV6Z basis (with up to / basis functions) the total energy reaches -19.870(5) H, 
recovering thus w 98% of correlation energy. The estimated exact energy is -19.8822 H. It is 
clear that despite the extensive determinantal expansion the fixed-node error is still visible. It is 
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just another demonstration that convergence of such correlated wave functions is slow and more 
sophisticated approaches are needed to reach beyond the limits of the determinantal expansions. 

3.6 Conclusions 

We reviewed some of the recent advances in understanding the general properties of nodes 
of fermionic wave functions. One of the key findings is that the nodal topologies of generic 
fermionic ground states appear to be remarkably simple. For d > 1 the node bisects the configu- 
rations space with the resulting nodal domains being connected. Intuitively, this is not completely 
unexpected since presence of any node raises the wave function curvature, i.e., the kinetic en- 
ergy. Therefore the energy minimization leads to decreasing both the nodal "volume" and also 
the corresponding curvatures. We have also seen that by imposing additional symmetries one can 
increase the nodal domain counts. What is less obvious that even in such cases the interactions 
tend to smooth out the nodal surfaces and fuse the multiple domains into the minimal number. 
This happens for both spin-polarized and unpolarized systems as we have shown on few-particle 
examples with correlated wave functions. 

Comparison of bosonic and fermionic ground states from the viewpoint of nodal topologies 
reveals an interesting perspective. It is well-known that generic bosonic ground states are node- 
less and so that one can say that they represent "global S-waves". On the other hand, the bisection 
of the configuration space suggests that generic fermionic ground states are "global P-waves". 
This is a remarkably beautiful result of the interplay between continuous space and symmetries: 
it shows that Nature has chosen the simplest topologies to fulfill the fundamental symmetries. 

Furthermore, analysis of QMC data also reveals that much of the correlation in fermionic 
systems has actually bosonic nature, i.e., particle correlations within the nodal domains such 
two-electron cusps, multi-particle even parity exchanges, etc. This gives a rather prominent role 
to the fixed-node DMC method which recovers such bosonic correlations exactly. Comparison 
of fixed-node results with estimations of exact total energies (from other methods for small num- 
ber of electrons and from experiments for extended systems) show that this is indeed the case. 
Calculations of atoms, molecules, clusters, solids, quantum liquids, etc, shows that the amount 
of recovered correlation is similar essentially in all cases and reaches 90-95% of the total corre- 
lation energy irrespective of the type of the system. This is very high when compared with other 
approaches based on explicitly correlated wave functions. In fact, for extended systems, there 
is no other method which can achieve such accuracy. Indeed, this is a significant outcome of 
the QMC efforts over the years and it is also one of the key reasons why these approaches are 
becoming more common. The achieved level of accuracy now enables to predict many "robust" 
quantities, i.e., which are larger than 0.1-0.2 eV, such as cohesions, optical gaps, etc. 

On the other hand, the effects which are beyond the fixed-node approximation still leave 
many important and exciting problems unsolved or unreachable. Subtle effects such as magnetic 
ordering, presence of new quantum phases, happen at scales of tens of meV or smaller and, con- 
sequently, can be significantly affected by the fixed-node bias. A limited understanding of many 
of these aspects is still the main reason why the progress has been very gradual albeit steady over 
the years. Although the origin of these errors is exclusively in inaccuracies of the nodal shapes at 
present there are no clear-cut solutions which would enable to describe these in an efficient and 
general manner. Also how much and which types of nodal errors can influence interesting quan- 
tum phenomena is not yet understood. One possible path forward is to get better handle of the 
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nodal variational freedom using more effective nodal representations or new constructions which 
will address this particular challenge. It seems that this is out of reach of traditional approaches: 
common correlated wave function constructions, such as expansion in excited determinants, ap- 
pear to be very slowly converging and are therefore inefficient, limiting the applications to small 
sizes. Such constructions are essentially non-existent for extended systems such as solids or 
surfaces. Some of the recently explored ideas in this direction will be presented below. Another 
possibility to overcome this fundamental obstacle might come from better use of statistical meth- 
ods to deal with the nodal inaccuracies. In this respect, we mentioned some approaches in the 
previous section and there are ongoing efforts by us and others in exploring ideas in this research 
area as well. Finally, "brute force" approaches, such as release node methods, are not necessarily 
useless and could be useful for benchmarks in smaller and intermediate size systems. 
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4 Pfaffian Pairing Wave Functions 



4.1 Introduction 



As we mentioned, commonly used trial wave functions are based on one-particle orbitals and 
Slater-Jastrow forms. In order to overcome this one-particle based constructions, pair orbitals 
have been previously tried in various frameworks. In condensed systems one such example 
is the Bardeen-Cooper-Schrieffer (BCS) wave function, which is an antisymmetrized product 
of singlet pairs. In quantum chemistry, the pair orbital is sometimes referred to as geminal 
and the resulting wave function as the antisymmetrized geminal product (AGP). It has been 
recently used to calculate several atoms and molecules as well as superfluid ultracold Fermi 
gases [85, 86, 97]. The results show promising gains, nevertheless, for partially spin-polarized 
systems the improvements are less systematic due to the lack of pair correlations in the spin- 
polarized subspace [85, 86]. The spin-polarized (triplet) pairing wave functions naturally lead to 
Pfaffians, which have been previously applied to model systems [98-100]. 

In this section, we further develop ideas from Sec. 2.4.2, in which we have introduced the 
Pfaffian form. Sec. 4.2 presents mathematical identities and expressions associated with Pfaffi- 
ans, some of them derived for the first time. In Sec. 4.3 we establish relationships between the 
Pfaffian, BCS and HF wave functions. These forms are tested on atomic and molecular systems 
in variational and fixed-node DMC OA methods as described in Sec. 4.4.1. In Sec. 4.4.2 we 
investigate generaUzations to linear combinations of Pfaffians and compare the results with other 
methods in regard to to wave functions compactness, the wave functions. In Sec. 4.4.3 we ana- 
lyze the topological differences between HF, Pfaffian and an essentially exact wave functions for 
a given test example. 



First introduced by Cayley in 1852 [101], the Pfaffian is named after German mathematician 
Johann Friedrich Pfaff. Given a 2n x 2n skew- symmetric matrix A = [aij], the Pfaffian of A is 
defined as antisymmetrized product 



pf[A] = A[ai^2a3,4, ■ ■ ■ a2n-l,2n] 



4.2 Algebra of Pfaffians 



Here we will closely follow our exposition of Pfaffian algebra from Ref. [2]. 



4.2.1 Definitions 




(125) 



a 



where the sum runs over all possible (2n— 1)!! pair partitions a = {{ii, ji), {i2, j2), ■ ■ ■ , (imjn))} 
of {1,2,..., 2n} with ik < jk- The sign of permutation associated with the partition a is de- 
noted as sgn(Q!). The Pfaffian for a matrix of odd order equals to zero. The following example 
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gives Pfaffian of a ^4(4 x 4) skew-symmetric matrix 








ai2 


Ol3 


ai4 


pf 


-ai2 





^23 


0-24 


-ai3 


-a23 





034 




-ai4 


-a24 


-(134 






= CH2<l34 — Cfl3<l24 + <ll4(l23- 



It can be also evaluated recursively as 

2n 

[^] = J2 sgn(aij) Oi, 

2n 

= Yai,jPc{ai,j), 

3=2 



(126) 



(127) 



where aij is partition with ik,jk 7^ 1) j and Pc(«i,j) is defined as Pfaffian cofactor of aij. The 
cofactor for an element Uj^k is given by a formula, 



Pc(a„fc) = (-ir+'=+Vf[^(.7,fc;j,fc)] 



(128) 



where the matrix A{j, k;j, k) has the rank 2(n — 1) x 2(n — 1) and is obtained from A by 
eliminating j and fc rows and colunms. 



4.2.2 Calculation of a Pfaffian 

There exist several identities involving Pfaffians and determinants. For any 2n x 2n skew- 
symmetric matrix A and arbitrary matrices B(2n x 2n) and M{n x n) we have the following 
relations: 

Ai " 

^2 

M 




pf 



pf 



(-l)>f[A] 


(129a) 


dct[^] 


(129b) 


pf[Ai]pf[A2] 


(129c) 


det[-B]pf[A] 


(129d) 


(-1) '2 *det[M]. 


(129e) 



Let us comment on these identities and point out to respective proofs as appropriate: 

(129a) Each permutation contains product of n pairs resulting in an overall (—1)" factor. 

(129b) This is a well-known Cayley's relationship between the Pfaffian and the determinant of a 
skew-symmetric matrix. This is a well-known relationship which has been proved many 
times in variety of ways [102-104]. 



(129c) Use the expansion by Pfaffian cofactors. 
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(129d) By squaring (4d), using Eq. (129b), and taking the square root one finds pi[BAB^] = 
±det[i?]pf [A]. Substituting the identity matrix / for B one finds + to be the correct sign. 

(129e) Assume 



in Eq. (129d). The overall sign is given by value of pf [A]. 

The identities Usted above imply several important properties. First, Eqs. (129d) and (129e) 
show that every determinant can be written as a Pfaffian, but on the contrary, only the absolute 
value of Pfaffian can be given by determinant [Eq. (129b)]. The Pfaffian is therefore a generalized 
form of the determinant. Second, by substituting suitable matrices [105] for M in Eq. (129d) one 
can verify the following three properties of Pfaffians [106], similar to the well-known properties 
of determinants. 

(a) MultipUcation of a row and a column by a constant is equivalent to multiplication of the 
Pfaffian by the same constant. 

(b) Simultaneous interchange of two different rows and corresponding columns changes the 
Pfaffian sign. 

(c) A multiple of a row and a corresponding column added to another row and corresponding 
column leaves the Pfaffian value unchanged. 

It is also clear that any skew-symmetric matrix can be brought to the block-diagonal form by an 
orthogonal transformation. Recursive evaluation [Eq. (127)] implies that the Pfaffian of block- 
diagonal matrix is directly given by 




and 




■ Ai 
-Ai 







pf 



As 
-A2 



— A1A2 . . . A„. 



(130) 









-A. 



A. 







Therefore by employing a simple Gaussian elimination technique with row pivoting (see Ref. [107]) 
we can transform any skew-symmetric matrix into block-diagonal form and obtain its Pfaffian 
value in O(n^) time. 
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However, in QMC applications, one often needs to evaluate the wave function after a single 
electron update. Since Cayley [104] showed (for the proof see [2]) that 



det 
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023 
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we can relate the Pfaffian of original matrix pf[A] to the Pfaffian of a matrix with updated first 
row and colunon pf [-B] using the inverse matrix in only 0{n) operations by 



pf[B] 



det[A] E,- bijA 



pf[A] 



(132) 



The second part of Eq. (132) was obtained by taking advantage of the identity in Eq. (129b). 
Slightly more compUcated relation between pf [A] and pf [-B] can be derived if one considers 
simultaneous change of two separate rows and colunms, which represents the two electron update 
of a wave function. 



4,2,3 Gradient and Hessian of Pfaffian 

In optimization routines, it is often favorable to calculate the gradient and Hessian of wave func- 
tion with respect to its variational parameters. Given the matrix elements A dependent on a set 
of parameters {c}, one can derive the following useful relations: 



and 
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(134) 



where A ^ is again the inverse of A. 
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4.3 Pairing Wave Functions 

Let us now consider the generalization of the one-particle orbital to a two-particle (or pair) orbital 
<p{i,j), where tilde again denotes dependence on both spatial and spin variables. The simplest 
antisymmetric wave function for 2A'' electrons constructed from the pair orbital is a Pfaffian 

* = ^[^(1, 2), 0(3, 4) . . . 0(27V - 1, 2N)] = pf[0(i,.7)]. (135) 

The antisymmetry is guaranteed by the definition (125), since the signs of pair partitions alter- 
nate depending on the parity of the corresponding permutation. The important difference from 
Slater determinant is that in the simplest case only one pair orbital is necessary. (This can be 
generahzed, of course, as will be shown later.) If we further restrict our description to systems 
with coUinear spins, the pair orbital (^(r^, (7^; rj,aj) for two electrons in positions rj and and 
with spins projections cr, and aj can be expressed as 

4>{r„a,;r,,aj) = 0(z, j)(a,a,|[| U) - I it)]/V2 (136) 

+x^Hi,j)M[\n) + \m/V2 

Here j) = (/'(r^, r^ ) is even while x^^, X^^ and x"^"^ are odd functions of spatial coordinates. 
In the rest of this section we will discuss special cases of the wave function (135). 



4.3.1 Singlet Pairing Wave Function 

Let us consider the first 1, 2, iV electrons to be spin-up and the rest N + 1, 2N electrons 
to be spin-down and allow only ^(rj, r^) in ^(rj, cTj; r^, aj) to be non-zero. Using the Pfaffian 
identity [Eq. (129e)], we can write the wave function for singlet pairs, also known as the 
Bardeen-Cooper-Schrieffer (BCS) wave function (or AGP), in the following form 

= det [*''"■'-], (137) 

which is simply a determinant of the N x N matrix ^'^^ = [<p{i,j)] as was shown previ- 
ously [108, 109]. 

Let us comment on the BCS wave function which has its origin on the theory of supercon- 
ductivity [110]. The original form is actually for variable number of particles, since the theory 
is formulated in the grand canonical ensemble — the number of particles adjusts to the chemical 
potential. Hence, if the grand canonical form is projected onto the state with fixed number of 
particles, its form is then given by Eq. (137). 

It is straightforward to show that the BCS wave function contains the restricted HF wave 
function as a special case. Let us define the Slater matrix C = [fiiJ)] where {ifi} is a set of HF 
occupied orbitals. Then we can write 

= det[C]det[C] = det[CC'^] = det[*^^^], (138) 
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where 

JV 

= <pHF{i,j) = ^<Pk(i)'Pk{j)- (139) 

k=l 

On the other hand, we can think of the BCS wave function as a natural generahzation of the HF 
one. To do so we write the singlet pair orbital as 

M>N 

ct>{i,j) = Sk,iMi}Vi{j) = <P{i}S<p{j), (140) 

k,l 

where the sum runs over all M (occupied and virtual) single-particle orbitals and S is some 
symmetric matrix. Therefore, we can define one-particle orbitals, which diagonaUze this matrix 
and call them natural orbitals of a singlet pair. 

The BCS wave function is efficient for describing systems with single-band correlations such 
as Cooper pairs in conventional BCS superconductors, where pairs form from one-particle states 
close to the Fermi level. 



4.3.2 Triplet Pairing Wave Function 

Let us assume, in our system of 2N electrons, that the first Mi electrons are spin-up and re- 
maining M2 = 2N — Ml electrons are spin-down. Further, we restrict Mi and M2 to be even 
numbers. Then by allowing only x^^(^) j) ^nd x^^ihj) (136) to be non-zero, we obtain from 
expression (135) by the use of Eq. (129c) 

r^^^ 



*TP = pf 



^ 



.4.4. 



pfK^ipf[ei, (141) 



where we have introduced Mi x Mi(Af2 x M2) matrices ^'^^^^^^ = [x^'^^^^\i, j)] ■ This result 
was previously found in a weaker form in Ref. [108, 109]. 

The connection to a restricted HF wave function for the above state can be again estabUshed 
as follows. In accord with what we defined above, dct[(C)'''(^'] are spin-up(-down) Slater deter- 
minants of some HF orbitals {(pi}. Then, by taking advantage of Eq. (129e) we can write 

= det[C''']det[C^] (142) 

_ pf[CUiC'''^]pf[C^A2C^^] 
~ pi[Ai]pi[A2] ' 

given Ai and A2 are some skew- symmetric non-singular matrices. In the simplest case, when 
Ai and A2 have block-diagonal form (130) with all values A, = 1, one gets 

*HF=pfK|,V]pf[dF]- (143) 
The pair orbitals can be then expressed as 

i^HF%,J=X^}l^''\i,j) (144) 

Mi(M2)/2 

= XI i'P2k-li^)f2k{j) - 'P2k-l{j)f2kii))- 
fe=l 
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Similarly to the singlet pairing case, the triplet pairing wave function appears as a natural gener- 
alization of the HF one. We can write the triplet pair orbitals as 

M>Mi(M2) 
k,l 

= V?(j)A^^(^^)¥>(j), (145) 

where again the sum runs over all M (occupied and virtual) single-particle orbitals and A^^^"^"^^ 
are some skew- symmetric matrices. Therefore we can define one-particle orbitals which block- 
diagonalize these matrices and call them natural orbitals of a triplet spin-up-up (down-down) 
pair. 



4.3.3 Generalized Pairing Wave Function 



Let us now consider a partially spin-polarized system with unpaired electrons. In order to in- 
troduce both types of pairing we allow x^^ih.l)' X.^^{hj) in (136) to be non-zero. 
However, we omit the x^^ihj) term. Then our usual ordered choice of electrons labels with all 
spin-up electrons first and remaining electrons spin-down enables us to directly write from (135) 
the singlet-triplet-impaired (STU) orbital Pfaffian wave function [1] 



^STU = Pf 



l-(p 



-f 



.IT 







(146) 



where the bold symbols are block matrices or vectors of corresponding orbitals as defined in 
Sections 4.3.1 and 4.3.2 and T denotes transposition. For a spin-restricted STU wave fimction 
the pair and one-particle orbitals of spin-up and -down channels would be identical. 

The Pfaffian form can accommodate both singlet and triplet pairs as well as one-particle 
unpaired orbitals into a single, compact wave function. The correspondence of STU, Pfaffian 
wave function to HF wave function can be estabUshed in a similar way for the pure singlet and 
triplet pairing cases. 

It is instructive to write down some illustrative examples of the introduced form for a few- 
electron atomic states. For example, for the Li atom with ^S'(ls'^2s) we can write 



*STi7 = pf 



e^t(i,2) </.n(i,3) ^T^(i) 

^^^2,1) </.t;(2,3) ^1,(2) 

cj>n(3,l) 0t4(3,2) cpl{3) 

-V'L(l) -VL(2) -<^i«(3) 



= e^ni,2)^t(3) - ^^^(1,3)(pL(2) +0^^(2,3)^L(1)- 



(147) 



It is easy to check that the expression has the correct antisyrmnetry. Also note that on can get the 
HF wave function by substitution 



(148) 
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or, alternatively, by setting 



(149) 



For the valence part of the carbon atom (we skip the He-core), we can write the state as ^P(2s^2p^) 
with the corresponding wave function 



^STU = Pf 



eHh2) eHh^) 0^^(i,4y 

eH2,l) Ctt(2,3) </,n(2,4) 

eH3,l) eHS,2) #H3,4) 

-#^(4,1) -#^(4,2) -(l)^^{4,3) 



(150) 



Finally, for the nitrogen atom valence electrons the quartet state given by ^5(2s^2p^) and the 
wave function can be written as 



STU 



pf 








^tT(i,3) 


^^(1,4) 


</.t^(l,5) 




eH2,i) 





^tT(2,3) 


^tT(2,4) 


</.t^(2,5) 


^H2) 




eHi,2) 





^^^3,4) 


</>n(3,5) 






eH^,2) 


e^^(4,3) 





(/>^^(4, 5) 


^t(4) 




-0ti(5,2) 


-0^^(5,3) 















-<^t(4) 








(151) 



4.4 Applications of Pairing Wave Functions in QMC 

In the remainder of this section, we would like to illustrate the applications of Pfaffian paring 
wave functions in QMC methods. As we have mentioned before in Sec. 2.4.2, the trial wave 
function used in QMC calculations by variational and fixed-node diffusion Monte Carlo is a 
product of an antisymmetric part times the Jastrow correlation factor: 

*t(R) = *a(R) eMUcorriinj}, {ni}, {rji})], (152) 

where Ucorr depends on electron-electron, electron-ion and, possibly, on electron-electron-ion 
combinations of distances. The antisymmetric part can be represented as '^a = '^hf and '9 a = 
^STU or 'i>A = ^Bcs- Further, the pair orbitals xihj) ^ind j) are expanded in products of a 
one-particle orbital basis [85] according to Eqs. (140) and (145). This approach was particularly 
convenient due to availability of one-particle atomic and molecular orbitals from either Hartree- 
Fock or CI correlated calculations. However, expansion of this type might not be the most 
optimal for homogeneous or extended periodic systems and other forms were also successfully 
employed [97]. For the applications below, we typically used about 10 virtual orbitals. The 
natural orbitals (see e.g. Ref. [22]) produced better and more systematic results than the HF 
ones. The pair orbital expansion coefficients were then optimized in VMC by minimizations 
of energy, variance or a combination of energy and variance (for details, see Sec. 7.2). The 
optimization procedure required the calculation of gradient and the Hessian of the wave function 
according to Eqs. (133) and (134). 
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4.4.1 Single Pfaffian Calculations 

The performance of single Pfaffian pairing wave functions in the STU form [Eq. (146)] was 
tested on several first row atoms and dimers [1]. The pseudopotentials [93, 111] were used to 
eliminate the atomic core electrons, except in Be atom case. 

The most important result is that STU wave functions systematically achieve higher percent- 
age of recovered correlation energy than single determinant wave functions in the fixed-node 
DMC method (see Table 2 and Fig. 14). Another significant result is that in general the triplet 
contribution for these single Pfaffian STU wave functions are small, perhaps the only exception 
being the nitrogen atom, where we see a gain of additional 1 % in correlation energy when com- 
pared to a trial wave function without triplet pairs. We believe, this is due to the fact, that ground 
state of nitrogen atom has a quartet spin state and therefore the highest spin polarization from 
all studied cases. Overall, the single Pfaffian form is capable of capturing near-degeneracies and 
mixing of excited states for both spin-polarized and unpolarized systems. 




System 



Fig. 14. Correlation energies obtained by QMC methods with the different trial wave functions: VMC and 
fixed-node DMC with HF nodes (HF) and STU Pfaffian nodes (PF). The lower plot shows the fixed-node 
DMC correlation energy gains over HF nodes for BCS and STU Pfaffian wave functions. The statistical 
error bars are of the symbol sizes or smaller. Except for the Be atom all the calculations used the same 
pseudopotentials [93, ill]. 
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Tab. 2. Total energies for C, N and O atoms and their dimers with amounts of the correlation energy recov- 
ered in VMC and DMC methods with wave functions as discussed in the text. Unless noted otherwise, the 
numbers in parentheses are the statistical errors in the last digit from corresponding QMC calculation. Ener- 
gies are in Hartree atomic units. For C, N, O atoms wc used the correlation energies by Dolg [1 12](0.1031, 
0.1303, 0.1937 H). For the estimation of correlation energies of dimers we needed accurate HF energies at 
experimental distances [113] and the estimated exact total energies. Each exact total energy was estimated 
as a sum of total energies of constituent atoms minus experimental binding energy [113-115] adjusted for 
experimental zero-point energy [115]. 



Melhod/WF 


C 




N 







E,,„,,,|%] 


HF 


-5.31471 





-9.62892 





-15.65851 





VMC/HF 


-5.3939(4) 


76.8(4) 


-9.7375(1) 


83.3(1) 


-15.8210(6) 


83.9(3) 


VMC/BCS 


-5.4061(2) 


88.6(2) 


-9.7427(3) 


87.3(2) 


-15.8250(3) 


86.0(2) 


VMC/STU 


-5.4068(2) 


89.3(2) 


-9.7433(1) 


87.8(1) 


-15.8255(3) 


86.2(2) 


DMC/HF 


-5.4061(3) 


88.6(2) 


-9.7496(2) 


92.6(2) 


-15.8421(2) 


94.8(1) 


DMC/BCS 


-5.4140(2) 


96.3(2) 


-9.7536(2) 


95.7(2) 


-15.8439(4) 


95.7(2) 


DMC/STU 


-5.4139(2) 


96.2(2) 


-9.7551(2) 


96.8(1) 


-15.8433(3) 


95.4(2) 


Est./Exact 


-5.417806 


100 


-9.759215 


100 


-15.85216 


100 


Method/WF 


C2 




N2 




O2 




HF 


-10.6604 





-19.4504 





-31.3580 





VIVIC/HF 


-10.9579(4) 


72.9(1) 


-19.7958(5) 


80.0(1) 


-31.7858(6) 


79.6(1) 


VIVIC/BCS 


-11.0059(4) 


84.7(1) 


-19.8179(6) 


85.0(1) 


-31.8237(4) 


86.7(1) 


VIVIC/STU 


-11.0062(3) 


84.8(1) 


-19.821(1) 


85.8(2) 


-31.8234(4) 


86.6(1) 


DIVIC/HF 


-11.0153(4) 


87.0(1) 


-19.8521(3) 


93.0(1) 


-31.8649(5) 


94.3(1) 


DMC/BCS 


-11.0416(3) 


93.5(1) 


-19.8605(6) 


94.9(1) 


-31.8664(5) 


94.6(1) 


DMC/STU 


-11.0421(5) 


93.6(1) 


-19.8607(4) 


95.0(1) 


-31.8654(5) 


94.4(1) 


Est./Exact'^ 


-11.068(5)^ 


100.0(10) 


-19.8825(6)'' 


100.0(1) 


-31.8954(1)'' 


100.0(1) 



"There is rather large discrepancy in the experimental values of C2 binding energy (141.8(9) [113], 143(3) [115] 
and 145.2(5) kcal/mol [1 14]). For the estimation of exact energy we have taken the average value of 143(3) kcal/mol. 
''Experimental binding energies taken from ref. [113]. 
"^The error bars on estimated exact total energies are due to experiment. 



4.4.2 IMulti-Pfafflan Calculations 

In order to capture tlie correlation energy missing from the single STU Pfaffian wave fimction 
and to test the limits of the Pfaffian functional form, we have also proposed to expand the in 
the linear combination of STU Pfaffian wave functions [1,2]. 

Following the approach adopted from the CI correlated calculations, accurate can be 
expressed as a linear combination of reference state ^0 and single ^^ and double^f^j excitations: 

N M N M 

^ciSD = co*o + E E + E E (153) 

i k ij kl 

where i-lh and j-th electrons are being excited into k and I virtual orbitals. Note that the the 
number of determinants in the expansion will be of the order of x M^. Analogously, we 
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Tab. 3. Percentages of correlation energies recovered for C, N and O atoms by VMC and DMC methods 
with wave functions as discussed in the text. The corresponding number of Pfaffians or determinants n for 
each wave function is also shown. For details, see caption of Table 2. 



MethodAVF 


n 


C 


n 


N 


n 


O 


VMC/MPF 


3 


92.3(1) 


5 


90.6(1) 


11 


92.6(3) 


VMC/CP 


98 


89.7(4) 


85 


91.9(2) 


136 


89.7(4) 


DMC/MPF 


3 


98.9(2) 


5 


98.4(1) 


11 


97.2(1) 


DMC/CP 


98 


99.3(3) 


85 


98.9(2) 


136 


98.4(2) 



"The determinantal weights were taken directly from CI calculation without re-optimization in VMC. 
postulate the multi-Pfaffian (MPF) wave function as a linear combination of STU Pfaffians: 

N N 
i ij 

where each cpij is the generalized paring orbital [Eq. (136)] containing aU possible exci- 
tations of i and j electrons. The resulting wave function will in general consist of only N'^ 
Pfaffians. Further, if the reference state is the most dominant state (i.e., cq '> c^j), it is 
possible to show by expanding cpij into orders of cfj/co that 

*MPF = ^ciSD + O {(c'lj/cof) . (155) 

In fact, the only difference between the and \1/c/sd wave functions is in presence of the 

higher order excitations, which are approximately present in the V^mpf- 

The mapping of the MPF wave functions onto equivalent CISD wave functions was also 
verified numerically in the variational and FN-DMC methods using the above first row atoms 
and molecules [1, 2]. The results in Table 3 show that for the atomic systems the MPF wave 
functions are able to recover close to 99% of correlation energy — very similarly to CISD wave 
functions, while requiring order less terms. The results for diatomic cases (see Table 4) show 
very similar behavior. The correlation energy recovered is on the order 98% with MPF wave 
functions closely matching the CISD wave functions — despite much richer electronic structure 
than in atomic cases. The comparison with the CI results therefore demonstrates that it is possible 
to obtain similar quality wave functions with corresponding improvements of the fermion nodes 
at much smaller calculational cost. 

4.4.3 Nodal Properties 

As we have already extensively discussed in Sec. 3, the fermion node is a complicated hyper- 
surface defined by an implicit equation '^a{R) — 0. Due to its complexity, we approximate 
the exact fermion node with the node of the best available trial wave function. However, any 
deviations from the exact node lead to the fixed-node DMC errors. It is therefore quite valuable 
to compare the nodes of a different accuracy. 
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Tab. 4. Total energies for C2 and N2 dimers with amounts of correlation energy recovered in VMC and 
DMC methods with wave functions as discussed in the text. Energies are in Hartree atomic units. The 
corresponding number of Pfaffians or determinants n for each wave function is also shown. For details, see 
caption of Table 2. 



Method/WF 


n 


C2 




n 


N2 


^corr l*^] 


VMC/MPF'= 


11 


-11.0402(1) 


93.1(1) 


16 


-19.8413(6) 


90.5(1) 


VMC/CP 


404 


-11.0409(3) 


93.3(1) 


535 


-19.8487(6) 


92.2(1) 


DMC/MPF'= 


11 


-11.0574(5) 


97.3(1) 


16 


-19.8670(8) 


96.4(2) 


DMC/CP 


404 


-11.0593(6)'' 


97.8(2) 


535 


-19.8713(5) 


97.4(1) 



"The determinantal weights were re-optimized in the VMC method. 

''Recently, Umrigar et.al. [84] published very accurate DMC result for fully optimized CI wave function with up to 
500 determinants for C2 molecule. The resulting well-depth of his calculation is 6.33(1) eV, which is only 0.03 eV 
form estimated exact value of Ref. [1 16]. The well-depth resulting from our DMC/CI energy of —11.0593(6) H equals 
to 6.08(3) eV. 

'^one type of triplet-like excitation not included. 




Fig. 15. A three-dimensional cut through the fermion node hypersurface of oxygen atom obtained by 
scanning the wave function with a spin-up and -down (singlet) pair of electrons at the equal positions, 
while keeping the rest of electrons at a given VMC snapshot positions (small green spheres). Nucleus is 
depicted in the center of the cube by the blue sphere. The three colors (from left to right) show nodes of: 
Hartree-Fock (red/dark gray); Multi-Pfaffian nodes (orange/medium gray); and the nodes of the CI wave 
function (yellow/light gray). The CI nodal surface is very close to the exact one (see text). The HF node 
clearly divides the space into four nodal cells while Pfaffian and CI wave functions partitioning leads to the 
minimal number of two nodal cells. The changes in the nodal topology occur on the appreciable spatial 
scale of the order of I a.u. 

The Fig. 15 shows the example of nodal structure of oxygen atom (Is^ electrons were re- 
moved by pseudopotential). Here we compare the nodal surfaces of HF (no pairing), MPF Pfaf- 
fian (STU pairing) and a high accuracy CI wave function with more than 3000 determinants, 
which gives essentially exact fermion nodes (i.e., 99.8(3)% of correlation energy in fixed-node 
DMC). From the figure, it is clear that the changes in the nodal surfaces are significant — the most 
important one being the elimination of artificial four nodal cells resulting from the independence 
of spin-up and -down channels in HF. The Pfaffian smooths-out the crossings and fuses the com- 
partments of the same sign into the single ones. These topology changes therefore lead to the 
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minimal number of two nodal cells (for details see Sec. 3). However, the MPF Pfaffian nodes 
are noticeably different close to nucleus and in the tails and the amount of missing correlation 
energy is still non-negligible. 

4.5 Conclusions 

The proposed Pfaffians with singlet and triplet pairs together with unpaired orbitals provides a 
variationally rich and compact wave functions. They offer significant and systematic improve- 
ments over commonly used Slater determinant-based wave functions and can be calculated as 
easily as determinants using the presented formulas. We have also shown relationships between 
HF, BCS(AGP) and Pfaffian wave functions. Clearly, Pfaffian pairing wave functions are able 
to capture a large fraction of missing correlation energy with consistent treatment of both spin- 
polarized and unpolarized pairs. We have explored Pfaffian expansions which enabled us to 
capture additional correlation and map them onto the equivalent CISD wave functions. Finally, 
the gains in correlation energy for Pfaffians come from improved fermion nodes, which are very 
close to the exact ones and exhibit the correct topology with the minimal number of two nodal 
cells. 
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5 Backfiow Correlations in Slater and Pfafflan Wave Functions 
5.1 Introduction 

Another approach for improvement of the nodal accuracy of variational trial wave functions is 

to employ backflow correlations. They were first introduced by Feynman and Cohen [57] as a 
simple ansatz to describe correlations and excitations in liquid ^He. Since then, a number of 
authors [55, 58-62] showed that the backflow correlations are useful also in fermionic systems. 
Recently, the backflow transformation of electron coordinates has been applied to chemical (e.i., 
inhomogeneous) systems [63, 64, 117, 118]. This section contains detailed description of in- 
homogeneous backflow transformation for multi-determinantal and Pfafflan wave functions and 
their applications. 

Let us try to briefly elucidate where the motivation for the backflow comes from. Con- 
sider the trial function of the famiUar form which we used a number of times before 'S't(R) = 
det[...]e^='"''', and let us apply the projector exp(— t'H) as given by 

exp(-r^)*T(R) = *t(R) - tH^t{B,) + 0{t'^) (156) 
In order to analyze the flrst order term we re-arrange the terms in the Hamiltonian 

n = T + Ve^t + Vee = T+ (Kc^t + Vmf) + (Ke - Kn/), (157) 

where Vmf is a one-particle mean-field approximation for Vee which corresponds to the theory 
used for generation of orbitals. A closer look on the first order term then leads to the following 

:«det[...]e^— =e^--(T + Kxt + Kn/)det[...] + det[...](T + - Vmf)e^^°^^ 

_ Ve^-"- -Vdet [...]. (158) 

Note that the first two terms should be close to ^t(^)Eioc{'R-), where E[oc(R-) is some function 
which is not very far from a constant. The reason is that in the first term the determinant solves 
the corresponding one-particle Hamiltonian, while in the second term the correlation is optimized 
to cancel out the difference Vee ~ Vmf - The third term appears to be "spurious" since it is not 
compensated by any potential. It has a form of the dot product of two "fluxes", one resulting 
from Vext + Vmf while the other from the electron-electron interactions. These terms need to be 
"cancelled" out what requires modifying the wave function or including additional terms. Which 
types of terms would be effective? We consider two limiting cases. The first one corresponds to 

|Vdet[...]| > |Ve^— I (159) 

with that the "flux" from the determinant having much larger amplitude so that the system is 
strongly inhomogeneous — ^typical behaviour for chemical systems and solid state materials. For 
these cases the excited determinants, Pfafflans, etc, appear as the most obvious, if not the most 
efflcient, way to cancel out the spurious component. On the other hand, if the inequahty goes the 
other way 



|Vdet[...]| < |Ve^'=""-'- 



(160) 
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then the system has density close to a constant or only mildly varying so that the electron-electron 
term dominates. This is typical for homogeneous electron gas, quantum liquids such ^He and 
^He, and similar systems. Feynman and Cohen showed that in this case the backflow coordinates 
can partially compensate for the spurious term, basically by sHghtly displacing a given particle 
to reflect the influence of particles which happen to be nearby. 



5.2 Backflow Wave Function Form 

As already discussed in Sec. 2.4.2, our trial wave function has the form 

*t(R) = *a(X) X exp[C/eo^^(R)], (161) 

where X = (xi, . . . , x^v) represents some quasi-particle coordinates dependent on all N elec- 
tron positions R. Further, 'f^ is either (multi)-determinantal or Pfaffian wave function and Ucorr 
is the Jastrow correlation factor both defined in the previous sections. 
The quasi-coordinate of ith electron at position is given as 

Xi = ri+^i(R) 

= r, + ^r(R) + C(R) + C"(R), (162) 

where is the ith electron's backflow displacement divided to the contributions from one-body 
(electron-nucleus), two-body (electron-electron) and three-body (electron-electron-nucleus) terms. 
They can be further expressed as 

^r{R) = J2uirij)rij (164) 



Cr"(R) = ^^[wi{rij,rii,rji)rij + W2{rij,rii,rji)rii], (165) 

where r^j = rj — and rn = rj — r/. The x. u and wi with 11)2 terms are similar to one, two 
and three-body Jastrow terms and are further expanded as 

X{r) = '^Ckak{r), (166) 

k 

u{r) =Y,dkbk{r), (167) 

k 

'Wi,2{rij,rii,rji) = ^ gkimak{rii)ai{rji)bm{rij). (168) 

klm 

The one dimensional basis functions {a} and {b} are chosen as Gaussians with the center in 
origin or as polynomial Fade functions (Sec. 2.4.3) to preserve the electron-electron [Eqs. (49) 
and (50)] and electron-nucleus [Eq. (48)] cusp conditions. The set of variational parameters 
{c}, {d} and {g} is minimized with respect to mixture of energy and variance (for details, see 
Sec. 7.2). In addition, all electron-electron coefficients {{dk} and {gkim} with fixed k and are 
allowed to be different for spin-like and for spin-unlike electron pairs. 
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5.3 Applications of Backflow Correlated Wave Functions 

In the remainder of this section we present VMC and DMC results obtained with above imple- 
mentation of backflow correlations for determinant and Pfaffian wave functions. 

5.3.1 Homogeneous electron gas 

We benchmark our implementation of the backflow correlations on the homogeneous electron 
gas (HEG) system. For comparison purposes, we choose to study the HEG system of 54 un- 
polarized electrons in the simple cubic simulation cell with periodic boundary conditions (the 
system was studied before several times [64, 1 19, 120]). Due to the homogeneity of the problem, 
the backflow displacement in Eq. (164) has only one non-zero component: ^1'^. We conveniently 
choose u{r) to be expanded in the basis of several polynomial Pade functions to preserve cusp 
conditions with cutoff equal to half of the simulation cell. Finally, our Jastrow correlation factor 
contains only electron-electron terms. 

In Table 5 we show our results for the foUowing three electron densities of = 1, 5 and 
20. The First important result is that at the normal density (r^ — 1), the size of correlation (e.i., 
Edmc — Ehf) is relatively smaU, but at lower densities (larger rg), the correlation energy is 
larger fraction of total energy and becomes more important. Second, we observe that the HF 
and Slater-Jastrow (SJ) fixed-node DMC energies are in very good agreement with previous 
results [64, 119, 120]. However, due to the omission of higher order correlations from Jastrow 
factor and also from backflow displacement, we obtain the expected higher VMC energies and 
variances for SJ and backflow displaced SJ (SJBF) trial wave functions. Nevertheless, our fixed- 
node DMC energies for SJBF trial wave functions closely match the results of Kwon et a/. [1 19], 
and only sUghtly deviate at = 20 from results of Rios et a/. [64]. 

5.3.2 Carbon atom and dimer 

The backflow correlations in single-determinant Slater-Jastrow (SJBF) [63, 64, 117] as well as 
in multi-determinant Slater-Jastrow (CIBF) [118] trial wave functions were recently applied to 
chemical systems. We extend these studies by including the backflow into the Pfaffian-Jastrow 
(PFBF) pairing wave functions. Below is a brief discussion of our implementation and results 
for carbon atom and dimer systems. 

We employ the inhomogeneous backflow given by Eq. (163), with the functions u and wi 2 
are allowed to be spin-dependent. The x and u functions are expanded in the 1 1 Gaussian basis 
functions, while the three-body functions wi 2 were limited to a product of 4 x 4 x 4 Gaussians. 

The numerical results are summarized in Tables 6 and 7. The backflow correlations are able 
to capture additional few percent of the correlation energy for both Slater-Jastrow and Pfaffian- 
Jastrow wave functions. Another important feature of backflow is 20-30% decrease in variances 
of local energy with respect to the wave functions without backflow correlations. We find that 
for the fully optimized backflow, the spin-unlike electron-electron functions are almost order 
of magnitude larger than spin-like ones as well as electron-nucleus functions. The gains are 
systematic with increasing number of parameters, however we do not find the three-body terms 
as important as reported in previous study [64]. This difference can be attributed to two main 
reasons — we use a different basis to expand the three-body fimctions wi^2 and we also eliminate 
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Tab. 5. VMC and fixed-node DMC energies per electron and variances of local energies for various trial 
wave functions (S, Slater; SJ, Slater-Jastrow; SJBF, backflow correlated SJ) for 3D unpolarized HEG of 54 
electrons. 





Method 


WF 


£;/Af[a.u./electron] 


o-^[a.u.^] 


1.0 


HF 


S 


0.56925(2) 


19.3(1) 




VMC 


SJ 


0.53360(4) 


1.26(4) 






SJBF 


0.53139(4) 


0.81(4) 




DMC 


SJ 


0.53087(4) 


- 






SJBF 


0.52990(4) 


- 


5.0 


HF 


S 


-0.056297(7) 


0.776(4) 




VMC 


SJ 


-0.075941(6) 


0.0636(1) 






SJBF 


-0.078087(4) 


0.0236(1) 




DMC 


SJ 


-0.07862(1) 








SJBF 


-0.07886(1) 




20.0 


HF 


S 


-0.022051(2) 


0.0482(1) 




VMC 


SJ 


-0.031692(2) 


0.000711(4) 






SJBF 


-0.031862(1) 


0.000437(1) 




DMC 


SJ 


-0.031948(2) 








SJBF 


-0.032007(2) 





atomic cores by pseudopotentials. It is plausible that for systems with core electrons the three- 
body correlations are more important due to the strong variations of orbitals close to the nucleus. 

Finally, let us discuss the difference between the two systems with respect to missing cor- 
relation energy. For the C atom, we have shown previously [1] that less than 100 determinants 
gives more than 99% of correlation energy {Ecorr)- The C dimer's fixed node errors are more 
pronounced, since the 148 determinants with re-optimized weights give only 97.5(1)% in a close 
agreement with recent calculations by Umrigar et flZ.[84]. Employing backflow correlations for 
our 148 determinant Cl-Jastrow wave function gives no apparent gain in E^orr except for de- 
crease in the variance of local energy. The improvement for the Pfaffian-Jastrow wave function 
is also very modest (less than 1%). Our results suggest that to reach beyond 99% of correlation 
one still needs complicated multi-reference wave functions, even after including quite general 
forms of the backflow correlations. 



5.4 Conclusions 

We have benchmarked our backflow correlated wave functions on the HEG system and achieved 
high level of agreement with previous results. Further, we have applied the Pfaffian pairing wave 
functions to chemical systems. The results for to testing cases of carbon pseudo atom and its 
dimer show promising gains in correlations energies, decreases in variances and improvements 
in the nodal structure. 
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Tab. 6. VMC and DMC energies and variances of local energy for Slater-Jastrow (SJ), Pfaffian-Jastrow 

(PF) and Cl-Jastrow (CI) trial wave functions with backflow (BF) correlations for C atom. Notation for 
backflow parameters: 2B, electron-nucleus and electron-electron terms; 3B, all electron-electron-nucleus 
terms; 23B for all terms together. 



Method 


WF 












E [a.u.] 


0-^ [a.u.^] 


'^corr [^^l 


HF 


S 


- 


- 


- 


- 


- 


-5.31471 


- 


0.0 


VMC 


SJ 


- 


- 


- 


- 


- 


-5.3990(1) 


0.0677 


81.8(1) 




SJBF2B 


11 


22 


- 


- 


33 


-5.4011(2) 


0.0544 


83.8(2) 




SJBF3B 


- 


- 


128 


128 


256 


-5.4023(3) 


0.0504 


85.0(3) 




SJBF23B 


4 


8 


128 


128 


268 


-5.4020(2) 


0.0498 


84.7(2) 




PF 












-5.4083(2) 


0.0626 


90.8(2) 




PFBF2B 


11 


22 






33 


-5.4097(1) 


0.0427 


92.1(1) 




PFBF23B 


4 


8 


128 


128 


268 


-5.4107(1) 


0.0411 


93.1(1) 




CP 












-5.4127(1) 


0.0447 


95.0(1) 




CIBF2B 


11 


22 






33 


-5.4131(3) 


0.0427 


95.4(3) 




C1BF23B 


4 


8 


128 


128 


268 


-5.4140(1) 


0.0342 


96.3(1) 


DMC 


SJ 












-5.4065(3) 




89.0(3) 




SJBF2B 


11 


22 






33 


-5.4090(3) 




91.5(3) 




SIBF3B 






128 


128 


256 


-5.4085(3) 




91.0(3) 




SJBF23B 


4 


8 


128 


128 


268 


-5.4094(3) 




91.8(3) 




PF 












-5.4137(3) 




96.0(3) 




PFBF2B 


11 


22 






33 


-5.4145(3) 




96.8(3) 




PFBF23B 


4 


8 


128 


128 


268 


-5.4152(3) 




97.5(3) 




CI 












-5.4178(1) 




100.0(1) 




CIBF2B 


11 


22 






33 


-5.4177(3) 




99.9(3) 




CIBF23B 


4 


8 


128 


128 


268 


-5.4174(2) 




99.6(2) 


Est. 


Exact 












-5.417806 




100.0 



'Wave function consists of 100 determinants re-optimized in VMC. 
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Tab. 7. Slater-Jastrow (SJ), Pfaffian-Jastrow (PF) and Cl-Jastrow (CI) wave functions with backflow (BF) 
correlations for C dimer. Notation is the same as in Table 6. 



Method 


WF 












E [a.u.] 


(7^ [a.u.^] 


-^corr [^] 


HF 


S 


- 


- 


- 


- 


- 


-10.6604 


- 


0.0 


VMC 


sr 


- 


- 


- 


- 


- 


-10.9936(4) 


0.179 


81.7(1) 




SJBF2B 


11 


22 


- 


- 


33 


-11.0012(3) 


0.144 


83.5(1) 




SJBF23B 


4 


8 


128 


128 


268 


-11.0014(2) 


0.141 


83.6(1) 
















-11.0171(2) 


0.160 


87.4(1) 




PFBF2B 


11 


22 






33 


-11.0223(3) 


0.123 


88.7(1) 




PFBF23B 


4 


8 


128 


128 


268 


-11.0223(2) 


0.128 


88.7(1) 




CP 












-11.0420(4) 


0.112 


93.6(1) 




CIBF2B 


11 


22 






33 


-11.0440(3) 


0.100 


94.0(1) 




CIBF23B 


4 


8 


128 


128 


268 


-11.0438(3) 


0.123 


94.0(1) 


DMC 


SJ 












-11.0227(2) 




88.8(1) 




SJBF2B 


11 


22 






33 


-11.0269(4) 




89.9(1) 




SJBF23B 


4 


8 


128 


128 


268 


-11.0280(3) 




90.1(1) 




PF 












-11.0419(9) 




93.5(2) 




PFBF2B 


11 


22 






33 


-11.0443(6) 




94.1(2) 




PFBF23B 


4 


8 


128 


128 


268 


-11.0447(3) 




94.2(1) 




CI 












-11.0579(5) 




97.5(1) 




CIBF2B 


11 


22 






33 


-11.0580(4) 




97.5(1) 




CIBF23B 


4 


8 


128 


128 


268 


-11.0585(5) 




97.7(1) 


Est. 


Exact 












-11.068(5) 




100.0 



^Slater determinant contains PBE DFT orbitals. 

''Same PBE DFT orbitals are used also in PF wave function. 

"^Uses natural orbitals with weights of the 148 determinants re-optimized in VMC. 
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6 Example of Large Scale Application: FeO Solid 

As an example of calculations, which are feasible at present using available both software and 
hardware resources, we briefly report our recent study of the FeO solid. 

FeO solid is a very interesting system which has been calculated a number of times before 
by several approaches. Nevertheless, it is difficult to claim that the problems and research ques- 
tions have been all "solved" and, in fact, our understanding of this and many other transition 
metal compounds is still rather limited. FeO belongs to the group of other paradigmatic transi- 
tion metal (TM) oxides such as MnO, CoO and NiO. All of these have a rather simple rocksalt 
(NaCl type) structure denoted as B 1 , with very small deformations at lower temperatures result- 
ing from effects such as magnetostriction. Since the transition elements in these systems have 
open d-shells, such solids should be nominally metals. However, for a long time it has been 
known and experiments have shown that these systems are large gap insulators. In addition, 
they exhibit antiferromagnetic ordering with the Neel temperatures of the order of a few hun- 
dred Kelvins. Interestingly, the gap remains present even at high temperatures when there is no 
long-range magnetic order. How is it possible? Using qualitative classification introduced by 
Zaanen and coworkers[121], there are basically two relevant and somewhat competing mecha- 
nisms: Mott-Hubbard and charge transfer. In the Mott-Hubbard picture, the gap opens because of 
large Coulomb repulsion associated with the double occupancy of the strongly localized d-states. 
The spin minority bands are pushed up in the energy, leading to gap opening. In the charge trans- 
fer mechanism, 4s electrons of the transition metal atom fill the unoccupied p-states of oxygen. 
The electronic structure has an almost ionic TM+^O^^ character, with resulting gap opening as 
well. In the real materials both of these mechanisms are present and therefore the systems exhibit 
insulating behavior. The antiferromagnetism happens on the top of this and is caused by weak 
super-exchange interactions between the moments of neighbouring transition metal atoms medi- 
ated by bridging oxygens. We will not go into further details, suffice it to say that the electron 
correlations are very large and crucially affect many properties of these materials. 

If one applies standard DFT approaches to FeO, the results are highly unsatisfactory. First, 
the atomic structure which comes out is not correct. Instead of the rocksalt antiferromagnet 
another structure, so-called iB8, appears to have the lowest energy at equilibrium conditions. 
Interestingly, iB8 is actually a high pressure phase of FeO. This discrepancy is surprising since 
DFT methods typically provide correct equilibrium structures and very reasonable geometric pa- 
rameters such as lattice constants and others. Another problem appears in the electronic structure 
since for the correct geometry DFT predicts a metallic state. More sophisticated methods beyond 
DFT have been applied to this system in order to reconcile some of the results with experiments, 
nevertheless, a number of questions remain unanswered. For example, at high pressures, FeO 
undergoes a structural transition into the iB8 phase, however, the value of the transition pressure 
which agrees with experiment is difficult to obtain using the mainstream approaches. Similar 
transition appears in MnO, where recent bechmarking of several DFT approaches provided tran- 
sition pressure estimates between 65 and 220 GPa, i.e., more than 300% spread in the predictions 
[122]. Large discrepancies with expriment exist also for the CoO crystal and more complicated 
transition metal compounds. 

We have carried out QMC calculations of FeO using supercells with periodic boundary con- 
ditions to model the infinite solid [123J. Several supercell sizes were calculated and k-point 
sampUng of the BriUouin zone was carried out by the so-called twist averaging with the purpose 
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Tab. 8. Comparision of the calculated structural properies of FeO solid in DFT and in the fixed-node DMC 
with experimental data. The energy difference Eiss-Esi is evaluated at the experimental lattice constant 
4.334 A. 



Method/quantity 



DFT/PBE FN-DMC 



Exper. 



EiBs-Esi [eV] 
cohesion energy [eV] 
lattice constant | Aj 
bulk modulus [GPa] 
band gap [eV] 



-0.2 [126] 

~ 11 
4.28 [124] 
180 [124] 

~0 



0.5(1) 
9.66(4) 
4.324(6) 
170(10) 

2.8(3) 



> 
9.7 [127] 
4.334[128J 
- 180 1125] 
2.4 [129] 



of eliminating finite size effects. The core electrons were replaced by pseudopotentials for both 
Fe (Ne-core) and O (He-core). The largest simulated supercells had more than 300 valence elec- 
trons and the total energies were sizeable due to the presence of "semicore" 3s and 3p states of 
Fe in the valence space, what have made the calculations rather demanding. The wave ftmction 
had the Slater-Jastrow form and the orbitals were obtained from unrestricted (spin-polarized) 
calculations within DFT with hybrid functionals and HF. 

In Tab. 8 are shown the QMC calculated equilibrium parameters. Note that QMC identifies 
the correct equilibrium structure and provides very accurate value of the cohesive energy. Cohe- 
sive energies are very difficult to calculate since the DFT methods show typical bias of 15-30%. 
At present, there is basically no other method besides QMC which can get this level of accuracy. 
Note also good agreement with experiments for the other quantities including the band gap. In 
QMC the band gap is calculated as a difference of two total energies — ground state and excited 
state, where the excited state is formed by promoting an electron from the top valence band into 
the conduction band. 

In addition, the equations of state have been calculated for both the equilibrium structure and 
also for the high pressure phase, see Fig. 16. The estimated transition pressure is 65(5) GPa, at 
the lower end of the experimental range 70-100 GPa [124, 125]. 

Finally, considering that only a simple trial wave function was employed, the results are 
remarkable and very encouraging. Note that the calculations do not have any free non- variational 
parameters. It is simply the best possible solution within the trial function nodes and the given 
Hamiltonian. 
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14 16 18 20 22 24 
V (A^/FeO) 

Fig. 16. FN-DMC Energy as a function of volume for FeO for Bl (red squares) and iB8 (blue cir 
phases. Lines are fits with Mumagham equation of state. 
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Method 
(VMC) 



System 
(Molecule) 



Most Abstraction 




Wavefunction_data 
(Multiply) 



Dynamics generator 
(Delayed Rejection) 



Wavefunction_data 
(Slater Determinant) 



Wavefunction_data 
(Jastrow factor) 



Molecular orbital 
evaluator(Linear) 




One-body Jastrow 



Two-body Jastrow 



Basisfunction 
(Gaussian) 



Basis_function 
(cutoff Fade) 



Basisfunction 
(cutoff Pade) 



T 

Least abstraction 



Fig. 17. Calculation structure for the VMC method on a molecule using a Slater- Jastrow wave function. 



7 QWalk: A Quantum Monte Carlo Program for Electronic Structure 

The solution of the electronic structure of tnany electron systems in quantum Monte Carlo re- 
quires the use of computers and appropriate software tools. For the practitioners of the QMC 
methods, the computational aspects such as the use of efficient algorithms, flexible code organi- 
zation and parallel efficiency are important, since they influence which types of calculations and 
which types of physical effects are feasible to study. This section is therefore aimed to introduce 
a reader to one of the existing QMC packages, Open Source QWalk code available under the 
GPL hcence [53] (www.qwalk.org). 

This section is organized as follows. In Sec. 7.1 we overview the program's organization and 
structure. The description of the implementation of wave function optimization is in Sec. 7.2. 



7.1 Organization and Implementation 

QwaUc ("Quantum Walk") is a computational package for QMC calculations of electronic struc- 
ture systems such as molecules and solids. However, others systems can be calculated such as ho- 
mogeneous electron gas, BCS-BEC condensates, ID rigns with persistent currents, etc. The code 
is written in a combination of object-oriented and procedural techniques. The object-oriented ap- 
proach is coarse-grained, creating independent sections of code that are written efficiently in a 
procedural fashion. It is extremely modular; almost every piece can be removed and replaced 
with another. For example, a contributor of a new module only has to change one line in the 
main code to allow its use. This allows for flexibihty while keeping the code base relatively 
simple and separable. The modular structure also allows for partial rewrites of the code without 
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Tab. 9. The central objects of the code and their physical correspondents 



Module name 



Mathematical object 



System 
Sample point 
Wave function type 
Wave function 
Dynamics generator 



parameters and form of the Hamiltonian 

R, the integration variables 
Wave function ansatz 

*t(R), V*t(R), V2*t(R) 
Metropolis trial move 
(Green's function) 



worrying about other parts. In fact, each major module has been rewritten several times in this 
manner as we add new features and re-optimize the code flexibility and performance. 

The modules form a tree of successive abstractions (Fig. 17). At the top of the tree is the QMC 
method, VMC in this case. It works only in terms of the objects directly below it, which are the 
concepts of System, Wave function data, etc. (see Table 9). These in turn may have further 
abstractions below them. The highest wave function object is of type "Multiply", which uses 
two wave function types to create a combined wave function. In this case, it multipUes a Slater 
determinant with a Jastrow correlation factor to form a Slater- Jastrow function. Since various 
wave functions can be plugged-in, the Slater determinant can be replaced with any antisymmetric 
function. The same applies to the Jastrow factor. The type is Usted along with the specific instant 
of that type in parenthesis. At each level, the part in parenthesis could be replaced with another 
module of the same type. The example of the flowchart of the whole program is given in Fig. 19. 



7.2 Optimization Methods in QWalls 

Historically, the first implementation of minimization method in QWalk code was the OPTIMIZE 
method, based on the quasi-Newton minimizer. It has proved effective for variance minimization 
of linear and nonlinear Jastrow parameters, however, the energy minimizations require very large 
configuration samples. 

This deficiency was partially fixed by the OPT1M1ZE2 method, which uses modified Hes- 
sians and gradients of variance and energy of Ref . [74] together with the Levenberg-Marquardt 
minimization algorithm. The analytical gradients and Hessians with respect to selected vari- 
ational parameters were implemented (determinantal and orbital coefficients, but also Pfaffian 
and pair orbital coefficients). OPTIMIZE2 has proved to be very effective method and has been 
used for most of recent calculations. 

In an attempt to remove the dependence of OPT1M1ZE2 method on the fixed MC sample, the 
NEWTON OPT method was implemented as well. Its principal advantage is that the energy and 
variance are obtained from a small MC run performed after each optimization step. 

Besides the variance and energy minimizations, it is possible to minimize several other ob- 
jective or cost functions. Their incomplete list can be found in Table 10. As was recently pointed 
out by Umrigar and Filippi [74], the wave functions optimized by minimization of a mix of en- 
ergy and variance produce almost as low energy as energy optimized wave functions but with 
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Tab. 10. List of some useful cost functions for wave function optimizations with indicated implementations 
in QWalk [53]. 

Cost Function Minimized quantity QWalk implementation 

OPTIMIZE OPT1MIZE2 NEWTON_OPT 
Variance {{El - Ef) ^ 7 7 

Energy _ _ V V \/ 

Mixed xE + {I - x){{El - Ef) V V \/ 

Absolute value (|£'l - £^1) ^ 

Lorentz (ln(l + {El - Ef/2)) \j 

E 

J 'I'T*(-r->-oo) 

/-^^ 



Ratio 
Overlap 



lower variances. 

We illustrate this for the ground state wave function of the N2 molecule (see Fig 18). The em- 
ployed NEWTON-OPT method minimized 23 Jastrow parameters (the curvatures of correlation 
basis functions and all linear coefficients) of single determinant Slater- Jastrow wave ftinction 
on set of 56000 walkers. The minimized energy from mixed objective function is almost as 
good as from the energy optimization, while the corresponding dispersion a falls in between a 
from energy minimization and variance minimization. The wave functions obtained from mixed 
objective functions therefore appear as the the most efficient compromise for further DMC cal- 
culations. 

7.3 Conclusion 

QWalk represents a state of the art, usable, and extensible program for performing quantum 
Monte Carlo calculations on several types of quantum systems. It is able to handle medium to 
large systems of electrons; the maximum size is mostly limited by the available computer time. It 
works in parallel very efficiently, so it can take advantage of large clusters, multi-core computers, 
etc. Since QWalk is available without charge and under the GNU Public license, it is hoped that it 
will help bring both development and use of quantum Monte Carlo methods to a wide audience. 
Due to its modular form it is straightforward to expand the QWalk's applicability to quantum 
systems beyond the electron-ion Hamiltonians in continuous space. With some knowledge of 
C++ it is easy to modify the system module to incorporate other types of interactions and to 
expand the one-particle and pair orbitals using the coded basis functions. 
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10 20 30 40 

Iteration number 




10 20 30 40 

Iteration number 



Fig. 18. Energy E and dispersion of the local energy a of the N2 molecule versus the minimization step 
number. Notation: minimization of variance (green triangles), energy (blue circles) and 95% mixture of 
energy and 5% of variance (red squares). Upper figure: Energy versus the iteration number. The error bars 
are of the symbol size or smaller. Inset: the later iterations on expanded scale. Note the variance minimized 
energy being higher by almost 10 mH. Lower figure: Same as the upper figure but for the dispersion of 
local energy a rather than energy. Inset shows that the mixture minimized a falls in between a from energy 
minimization and variance minimization. 



M. Bajdich and L. Mitas 



Choose system, pseudopotential, 
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Generate one-particle orbitals 
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Fig. 19. Flow 



of a QMC calculation 
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8 Summary 

We have presented an overview of QMC methods and some of the recent developments in the 
analysis of the fermion nodes, constructions of pairing wave functions and explorations of back- 
flow correlations. Most of the apphcations were using small systems both because of educational 
purposes and in some cases because these were the first benchmarks of the new developments. 
QMC methodology has proved to be a very powerful technique for studies of quantum systems 
and also real materials as illustrated on the FeO solid calculations. In essence, QMC has a num- 
ber of advantages when compared with other approaches: 

- direct and explicit many-body wave function framework for solving the stationary Schrodinger 
equation; 

- favorable scaUng with the systems size; 

- wide range of applicabiUty; 

- the fixed-node approximation which enables to obtain 90-95% of the correlation effects; 

- scalability on parallel machines; 

- new insights into the many-body phenomena. 

The remaining fixed-node errors account for small, but still important fraction of correla- 
tions. As we have shown, further improvement caUs for better understanding of the nodes and 
their efficient descriptions. It is encouraging that studies of the nodal properties brought new 
understanding such as the two nodal cell property for generic fermionic states and revealed new 
research directions which are yet to be explored. 

We have shown that the pairing orbitals and Pfaffian wave functions show significant im- 
provements in providing more systematic results and higher accuracy. The key virtue of the 
pairing functions is that they exhibit the correct nodal topologies and as such are capable to de- 
scribe quantum condensates and pairing effects. As we explained, the backflow has perhaps more 
Umited scope of usefulness and appears to be effective especially for homogeneous or marginally 
inhomogeneous systems. Many generalizations of QMC exist. For example, for systems with 
magnetic fields and stationary currents the fixed-node methods can be generalized to the fixed- 
phase QMC [130, 131]. For finite temperature equilibrium properties Path Integral Monte Carlo 
has been used very successfuly especially for bosonic quantum hquids. 

The overall impact of the electronic structure QMC methods has been so far mainly in provid- 
ing high accuracy and benchmark results which were used by other methods as a reference. We 
believe that the impact of QMC methods is on the rise and their use will gradually become more 
routine. Already at present, there are cases for which developing elaborate mean-fields take more 
human time than direct solutions by many-body stochastic approaches. However, as we stated in 
the introduction, much needs to be done to make the methods more insightful, efficient and easy 
to use. 
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Quantum Monte Carlo (QMC) is an advanced simulation methodology for studies of many- 
body quantum systems. The QMC approaches combine analytical insights with stochastic 
computational techniques for efficient solution of several classes of important many-body 
problems such as the stationary Schrodinger equation. QMC methods of various flavors have 
been applied to a great variety of systems spanning continuous and lattice quantum models, 
molecular and condensed systems, BEC-BCS ultracold condensates, nuclei, etc. In this re- 
view, we focus on the electronic structure QMC, i.e., methods relevant for systems described 
by the electron-ion Hamiltonians. Some of the key QMC achievements include direct treat- 
ment of electron correlation, accuracy in predicting energy differences and favorable scaling 
in the system size. Calculations of atoms, molecules, clusters and solids have demonstrated 
QMC apphcabihty to real systems with hundreds of electrons while providing 90-95% of 
the correlation energy and energy differences typically within a few percent of experiments. 
Advances in accuracy beyond these hmits are hampered by the so-called fixed-node approx- 
imation which is used to circumvent the notorious fermion sign problem. Many-body nodes 
of fermion states and their properties have therefore become one of the important topics for 
further progress in predictive power and efficiency of QMC calculations. Some of our re- 
cent results on the wave function nodes and related nodal domain topologies will be briefly 
reviewed. This includes analysis of few-electron systems and descriptions of exact and ap- 
proximate nodes using transformations and projections of the highly-dimensional nodal hy- 
persurfaces into the 3D space. Studies of fermion nodes offer new insights into topological 
properties of eigenstates such as explicit demonstrations that generic fermionic ground states 
exhibit the minimal number of two nodal domains. Recently proposed trial wave functions 
based on pfafflans with pairing orbitals are presented and their nodal properties are tested 
in calculations of first row atoms and molecules. Finally, backflow "dressed" coordinates 
are introduced as another possibihty for capturing correlation effects and for decreasing the 
fixed-node bias. 
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1 Introduction 

Properties and behavior of quantum many-body systems are determined by the laws of quantum 
physics which have been known since the 1930s. The time-dependent and stationary Schrodinger 
equations describe molecules, condensed matter systems, surfaces and interfaces, but also arti- 
ficial structures such as quantum dots, nanowires, ultracold condensates and so forth. However, 
the task of solving the Schrodinger equation for systems of electrons and ions and predicting 
the key quantities of interest such as cohesion and binding energies, optical gaps, chemical reac- 
tion barriers, types of magnetic order or presence of quantum condensates seems to be beyond 
formidable. Indeed, Paul Dirac recognized this already in 1929: "The general theory of quantum 
mechanics is now almost complete. The underlying physical laws necessary for the mathemati- 
cal theory of a large part of physics and the whole chemistry are thus completely known, and the 
difficulty is only that the exact application of these laws leads to equations much too complicated 
to be soluble." Let us therefore try to understand the key obstacles. The first complication comes 
from the fact that a typical system of interest involves large number of particles. Even moderate 
size molecules or periodic supercells of solids contain from tens to thousands or more quantum 
particles. Second, the particles interact and for interesting problems the interactions are strong 
and influence the systems in profound ways. Third, the solutions have to conform to the required 
quantum symmetries such as fermionic antisymmetry and others. This is a fundamental depar- 
ture from classical systems and makes the problem particularly challenging. Fourth, the required 
accuracy for sensible predictions, which are comparable with experiments, is very high. 

Past studies of quantum systems were often based on ideas of reducing the interactions into 
various types of effective mean-fields. These approaches gradually developed into a high level of 
sophistication and despite limitations, they are still very useful and important on their own or as 
starting points for building new theories. At the same time, advent of computer technology has 
offered us a new window of opportunity for studies of quantum (and many other) problems. It 
spawned a "third way" of doing science which is based on simulations, in contrast to analytical 
approaches and experiments. In a broad sense, by simulations we mean computational models of 
reality based on fundamental physical laws. Such models have value when they enable to make 
predictions or to provide new information which is otherwise impossible or too costly to obtain 
otherwise. In this respect, QMC methods represent an iUustration and an example of what is the 
potential of such methodologies. 

Some of the ideas used in QMC methods go back to the times before the invention of elec- 
tronic computers. Already in 1930s Fermi noticed similarities between the imaginary time 
Schrodinger equation and stochastic processes in statistical mechanics. Apparently, scientists 
at the Los Alamos National Laboratory attempted to calculate the hydrogen molecule by a sim- 
ple version of QMC in the early 1950s, around the same time when a seminal work on the first 
Monte Carlo study of classical systems was published by Metropolis and coworkers. In the late 
1950s Kalos initiated QMC simulations for a few-particle systems and laid down the statistical 
and mathematical background of the Green's function Monte Carlo method. Pioneering QMC 
calculations of the correlation energy of homogeneous electron gas by Ceperley and Alder in 
1980 started a new era of stochastic methods applied to electronic structure problems. QMC 
methods were then further developed for quantum lattice models, quantum liquids, nuclear and 
other systems with contributions from many scientists. 

Currently, the term "quantum Monte Carlo" covers several related stochastic methodologies 
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adapted to determine ground or excited states or finite-temperature equilibrium properties for a 
variety of quantum systems. Note that the word "quantum" is important since QMC approaches 
differ in many ways from the Monte Carlo methods for classical systems. In this overview we 
present QMC methods which enable to solve the stationary Schrodinger equation for interact- 
ing Hamiltonians in continuous space. QMC reaches the high accuracy by a direct treatment of 
many-body effects and by employing stochastic computational techniques which enable to cap- 
ture many-body effects well beyond the scope of mean-field or analytical methods. Conceptually 
very straightforward is the variational Monte Carlo (VMC) which involves construction of trial 
(variational) wave functions and evaluations of corresponding expectation values by stochastic 
integration. More advanced approaches are based on projection techniques such as the diffu- 
sion Monte Carlo (DMC) which projects out the ground state from a given trial wave function. 
The method is formally exact, however, for fermions it suffers from a well-known fermion sign 
problem. The reason is that in using statistical methodology we tacitly assume that the sampled 
distributions are nonnegative. Unfortunately, for fermions this not true and in order to circumvent 
this obstacle many practical implementations of DMC employ the so-called fixed-node approxi- 
mation. This introduces the fixed-node error which is one of the focal points of this overview. 

Elimination or, at least, alleviation of the fixed-node errors is therefore of interest from both 
fundamental and practical point of views — since this is essentially the only approximation which 
is not deeply understood and also because its full control requires inordinate computational re- 
sources. At first, the nodal problem looks intractable since many-body wave functions nodes are 
complicated high-dimensional hypersurfaces determined by the full solution of the Schrodinger 
equation. However, study of nodal structures has revealed a number of important properties of 
fermionic states such as nodal domain topologies and their relationships to ideas in geometry and 
spectral theory of operators. Therefore, some of the latest research developments on fermionic 
wave function nodes will be presented and explained. 

Along the same line of the research, we outline current developments in construction of 
efficient and accurate wave functions which explore ideas beyond the traditional determinants of 
single-particle orbitals. Recently proposed Pfaffian wave functions built from pairing orbitals, 
which provide higher accuracy and also correct nodal topologies, will be explained [1,2]. In the 
subsequent part, a fermion coordinate transformation of the backflow type which was recently 
tested for inhomogeneous systems is introduced. We generalize its application to Pfaffian wave 
functions and illustrate the results on some of the first benchmarks of these ideas. 

Fixed-node QMC simulations are computationally more demanding by two to three orders of 
magnitude when compared to the mainstream electronic structure methods which rely on mean- 
fields treatments of electron-electron interactions. However, traditional mean-field theories often 
lack the needed accuracy and also require frequent re-tuning, sometimes just to qualitatively rec- 
oncile the results with experiments. Although at present, building of accurate many-body wave 
functions is far from being automatic and demands very significant effort as well, the results of- 
ten provide high accuracy and become benchmarks for other methods. For cohesive and binding 
energies, band gaps, and other energy differences the agreement with experiments is typically 
within 1-3% [3-5]. The computational cost of QMC increases basically as a third power of the 
system size, making calculations with hundreds of electrons tractable and large clusters [6] and 
solids [7] up to 1000 electrons have been already studied. Producing accurate "numbers" has 
its own merit and the predictive power is particularly valuable for experiments and for practi- 
cal purposes. However, even more important are new insights into quantum phenomena and 
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detailed analysis of theoretical ideas. Indeed, QMC is very much in the line of "it from bit" 
paradigm, alongside, for example, of substantional computational efforts in quantum chromody- 
namics which not only predict hadron masses but, at the same time, contribute to the validation 
of the fundamental theory. Similar simulations efforts exist in other areas of physics as well. 
Just a few decades ago it was almost unthinkable that one would be able to solve Schrodinger 
equation for hundreds of electrons in an explicit, many-body wave function framework. Today, 
such calculations are feasible using available computational resources. At the same time, much 
more remains to be done, of course, to make the methods more insightful, more efficient and 
their application less laborious. We hope this overview will contribute to the growing interest in 
this rapidly developing field of research. 



Quantum Monte Carlo 



The review is organized as follows: 

• The remainder of this Section provides brief introduction into several traditional methods 
of solving the stationary Schrodinger problem for electron-ion systems. 

• The next section follows with the description of the methodology behind the VMC and 
DMC methods. 

• Results of studies on exact and approximate nodes of fermionic wave functions are sum- 
marized in the Sec. 3. 

• Section 4 discusses generalized pairing wave functions and their properties. 

• Overview of preliminary results on the backflow wave functions is given in the section 5. 

• Section 6 shows the example of large scale QMC apphcation: FeO solid. 

• A QMC package Qwalk is introduced in Sec. 7. 

• The last section concludes with comments and ideas on current developments and out- 
looks. 

1.1 Many-Body Schrodinger Equation 

Let us consider a system of quantum particles such as electrons and ions with Coulomb inter- 
actions. Since the masses of nuclei and electrons differ by three orders of magnitude or more, 
let us simplify the problem by the Bom-Oppenheimer approximation which allows to separate 
the electrons from the slowly moving system of ions. The electronic part of non-relativistic 
Born-Oppenheimer Hamiltonian is given by 

where i and j label the electrons and while / sums over the ions with charges Z/. (Throughout 
the review we employ the atomic units, a.u., with = me = e = 1.) The basic properties of 
such systems can be foimd from eigenstates ^„ of the stationary Schrodinger equation 

H^^ = S„*„. (2) 



Colloquially, we call such solutions (exact or approximate) and derived properties as electronic 
structure. 

The history of electronic structure ab initio methods (see, for example, Ref. [8]) started 
soon after the discovery of quantum mechanics by pioneering works of Heitler and London [9], 
Hartree [10], and Hylleraas [11, 12]. An important step forward was made by Fock [13] who 
established the simplest antisymmetric wave function and formulated the Hartree-Fock (HF) the- 
ory which takes into account the Pauli exclusion principle [14]. The theory was further developed 
by Slater [15] and others and it has become a starting point of many sophisticated approaches not 
only in condensed matter physics or quantimi chemistry but also in other areas such as nuclear 
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physics. The HF theory replaces the hard problem of many interacting electrons with a system 
of independent electrons in an effective, self-consistent field (SCF). 

For periodic systems the effective free electron theory and the band theory of Bloch [16] 
were the first critical steps towards understanding real crystals. In 1930s, the foundation of 
the basic classification of solids into metals, semiconductors and insulators was laid down and 
later Wigner and Seitz [17, 18] performed the first quantitative calculation of electronic states 
in sodium metal. Building upon homogeneous electron gas model, the density functional theory 
(DFT) was invented by Hohenberg and Kohn [19] and further developed by Kohn and Sham [20]. 
Subsequent developments proved to be very successful and DFT has become the mainstream 
method for many applications which cover not only condensed systems but also molecules, sur- 
faces, even nuclear and other systems. DFT together with HF and post-HF methods are relevant 
for our discussion of quantum Monte Carlo, which uses the results from these approaches as a 
reference and also for building wave functions. What follows is their brief overview. 



1.2 Hartree-Fock Theory 

Due to the Pauli exclusion principle, any solution of the Schrodinger equation with Hamilto- 
nian (1) has to be antisymmetric imder coordinate exchange of any two electrons as 



(3) 



where i, j denote both spatial and spin degrees of freedom. The Hartree-Fock theory [13, 15] is 
based on the simplest antisymmetric wave function, sometimes called the Slater determinant. 



*(1, 2, . . . , TV) = (l)^i.(2) • . • 'Pi AN) 



(4) 



(^i(l) ^i(2) 

^2(1) ^2(2) 

<^jv(l) <^iv(2) 



<Pn{N) 



= det[^i(l),...,<^jv(A^)], 



to approximate the state of A^-electron system. are one-particle spin-orbitals, each being 

a product of the spatial (pf{j) and spin ai{(Tj) orbitals. (Note that ffiJ) is independent of spin 
<j in closed-shell cases. In open-shell systems, this assumption corresponds to the spin-restricted 
Hartree-Fock approximation). The Hartree-Fock total energy can be found from the following 
variational condition 



= min; {ipj\^j) = 1, j = 1, ...,N. 



(5) 
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If the Hamiltonian is spin-independent or it is diagonal in the spin basis a = {| t); | 4-)}) the 
expectation value of Hamiltonian (1) with the wave function (4) is given by 



E 



HF 



^-(r)^-(r')¥'J(rX(r') 



<(r)dr 

1 J2 /V¥'r*W'P.?*(r')^rWv'.?(r') 

- ^ E// dr dr'. (6) 

The expression for the HF energy contains three different terms. The first one represents one- 
particle contributions, the second is the Coulomb interaction also called the Hartree term and 
the last contribution is the exchange or Fock term. Note that for i = j (self-interaction) the last 
two terms explicitly cancel each other. Variation of the HF total energy leads to one-particle 
HF equations with the resulting spectrum {ip^ , Cj} of occupied and virtual orbitals obtained by 
solving 



-^V2 + yext + ^HF({<}) 



<(r) = ei^1{v) (7) 



The effective field operator Vrf contains both Coulomb and exchange terms and due to the self- 
consistency depends also on the set of occupied orbitals. The resulting operator is both nonlinear 
and nonlocal as manifested by the fact that Vrf, in general, differs for each state i. This compli- 
cated dependence makes the solution of HF equations rather involved. The solutions are usually 
expanded in an appropriate basis sets and iterated to self-consistency. Sophisticated and pow- 
erful tools such as well-developed and extensive quantum chemical or solid state packages are 
necessary for such solutions. For solids the HF equations were out of reach for a long time and 
only in late eighties the first package had emerged which was able to find reasonably accurate 
and converged solutions. 



1.3 Post Hartree-Fock Methods 

The HF energy is variational, i.e, above the exact eigenvalue, and whatever is left out is referred 
to as electronic correlation. For a given state, the correlation energy is therefore defined as a 
difference of the HF energy and the exact eigenvalue, 

Ecorr = Ehf — E. (8) 

In electron-ion systems Ecorr is only a small fraction of the total energy, of the order of ~ 1-3 
%, however, correlations impact almost all quantities in a major way. For example, correla- 
tion accounts for a large portion of cohesions, excitations, barrier heights and other important 
energy differences. Many other effects such as magnetism or superconductivity are driven by 
correlations so that its accurate description becomes crucial. Revealing the nature of correlation 
effects and how they influence the behavior of quantum systems is therefore one of the central 
challenges of modem electronic structure research. 
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The missing correlation in the HF theory can be accounted for by expanding the wave func- 
tion into linear combination of Slater determinants with one-, two-, A^-particle excitations. 
Providing the one-particle set of states is complete and all possible excitations are covered one 
can formally reach the exact solution. Unfortunately, the convergence of such methods is slow 
and the computational demands grow exponentially with the system size. Nevertheless, for small 
systems such methods are effective and provide valuable checks for other approaches. The most 
frequently used post-HF approaches are configuration interaction (CI), multi-configurational 
self-consistent field (MCSCF), and coupled cluster (CC) methods. The idea behind CI is to 
diagonalize the A/'-electron Hamiltonian on the space of orthogonal Slater determinants. These 
determinants are typically constructed as few-particle excitations from the reference determinant 
(usually the HF solution) using the virtual HF orbitals. The CI method thus provides the most 
optimal expansion on such determinantal space. The most commonly used is CI with all the 
single and double excitations (CISD). Besides the slow convergence in one-particle orbitals the 
method becomes problematic with increasing N since the recovered correlation is proportional 
to VN (instead of N), thus leading to the well-known size-consistency problem (i.e., a wrong 
scaling of total energy with N). 

The MCSCF is a generalization of the expansion in excited determinants and differs from 
CI by optimization of both of the orbitals and the determinantal coefficients. Such optimizations 
are quite difficult and the demands often limit the size of the problem which can be studied. The 
special case of the MCSCF is the complete active space SCF (CASSCF), which is size consistent, 
at the price of exponential scaling in the number of particles. 

The size-consistency problem can be overcome by going beyond linear expansions as it is 
done in the coupled cluster method [21]. The wave function ansatz is based on exponentiated 
excitation operators and the corresponding optinnized parameters are found by solving a set of 
coupled nonlinear equations. The computational cost for CCSD (with singles and doubles) scales 
as and therefore constitutes a formidable challenge for larger systems. The commonly used 
CCSD(T) (CC with single, double and perturbative triple excitations) is often used as a standard 
in accuracy for small systems. For an overview and detailed introduction into the above men- 
tioned quantum chemical methods we refer the reader, for example, to the book by A. Szabo and 
N. S. Ostlund [22]. 



Previous methods were examples of the wave function based theories. For the density functional 
theory (DFT) the primary object is the one-particle electron density, i.e., the diagonal part of 
one-particle density matrix. DFT is formally an exact method based on Hohenberg and Kohn 
theorem [19] that states that the ground state properties of a many-electron system can be ob- 
tained by minimizing the total energy functional 



where the F[/9(r)] is some unknown, universal functional of the electronic density p{v). Assum- 
ing such functional would be known, the total energy E would exhibit a minimum for the exact 
ground state density p(r). 



1.4 Density Functional Tlieory 




(9) 
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Kohn and Sham [20] introduced an ansatz for the density in terms of one-electron orbitals of 
an auxiliary non-interacting system as 



N 



p{T) = J2\^i{r)f. (10) 

i 

The total energy of electronic system can be then expressed as 

m^)] =-\Y.j WVVi(r) dr+ j p(r)Kxt(r) Av+\ j j ^^^^ AvAv'+E^M^)]. 

(11) 

where the first two terms represent the familiar energy of a non-interacting system in the ex- 
ternal potential, the third term being the Hartree term while the rest is an unknown universal 
exchange-correlation functional. By minimization of the total energy as a functional of normal- 
ized occupied orbitals one can derive the Kohn-Sham equations 



<f'({v) = ei^1{v). (12) 



The structure of the equations appears similar to the HF equations, the key difference is that 
the effective exchange-correlation operator is nonlinear yet local, since it depends only on the 
density but not on individual states. This feature of the DFT theory simplifies the calculations 
considerably and it has enabled to expand the DFT applicability to very large systems. If the 
functional £'a;c[/5(r)] would be precisely known, we would have one to one mapping between 
difficult many-electron system and an effective one-particle problem. The fundamental difficulty 
is that the exact exchange-correlation functional is unknown and therefore we have to rely on 
approximations. 

The simplest is the local density approximation (LDA), 

£.Jp(r)] = j e^,^«[p(r)]p(r)dr, (13) 

where e^'^^ is the exchange-correlation energy of the homogeneous electron gas of the same 
density. It is interesting that basically in all practical implementations of LDA, the correlation 
portion of this energy functional comes from the pioneering QMC calculations of the homoge- 
neous electron gas [23]. 

It is intuitively suggestive to generalize the exchange-correlation functional to include also 
the density derivatives, i.e., to express ea;c[p(r), Vp(r)] as a function of the local density and its 
gradient. This was the essential idea behind the generalized gradient approximation (GGA) [24— 
27], which provide improved results for some quantities and has enabled wide-spread use both 
in condensed matter physics and quantum chemistry. 

There are many flavors and generalizations of the density functional theory, e.g., hybrid- 
functionals [24, 26, 27], time-dependent DFT (TD-DFT) [28, 29] for calculations of excited 
states, extensions to non-local density fimctionals (averaged density approximation (ADA) [30] 
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and weighted density approximation (WDA) [30]), or orbital-dependent functionals (e.g. self- 
interaction corrections (SIC) [31] and LDAh-U [32]). For some systems the DFT methods are 
very effective and provide reasonable accuracy at very low cost. Unfortunately, there are many 
cases where DFT requires further elaboration: introduction of ad hoc parameters such as in 
LDAh-U approach or perturbative corrections such as in the so-called GW method. Even then 
for many systems such as transition metal compounds, weakly interacting van der Waals systems 
and other cases the DFT has profoimd difficulties in reaching the required accuracy. 
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2 Quantum Monte Carlo Methods 



2.1 Introduction 

Quantum Monte Carlo (QMC) is a suite of approaches based on two complementary strategies 
for solving quantum many-body problems. The first key component is an analysis of many-body 
effects at the wave function level: wave function analytical structure and impact of potential 
singularities, efficient functional forms, handling of symmetries and boundary conditions, ex- 
plicit multi-particle correlations and construction of intermediate effective Hamiltonians which 
enable to build-up more accurate many-body treatment. The second key ingredient is based on 
employing stochastic techniques whenever it is impractical or inefficient to use analytical and 
explicit constructions. This is done on several fronts: evaluations of high-dimensional integrals, 
solving of quantum equations by mapping onto equivalent stochastic processes, development of 
algorithms and numerical approaches for high accuracy and efficiency. 

Over the last three decades, the impact of QMC has been steadily growing as scientists ex- 
plored new algorithms and produced results which were otherwise impossible to obtain by tra- 
ditional methods. Currently, QMC covers a great variety of specialized approaches tuned to 
particular Hamiltonians and applications. In this review we will briefly describe the variational 
and diffusion quantum Monte Carlo methods. 

In the variational Monte Carlo (VMC) the expectation values are evaluated via stochastic 
integration over SA^ -dimensional space using trial (variational) wave functions constructed to 
capture the desired many-body phenomena. The variational theorem ensures that the expectation 
value of the Hamiltonian is, within the statistical error bar, a true upper bound to the exact ground- 
state energy. The results are determined by the choice and quality of the trial wave function 
construction and therefore are subject to a variational bias. The second described method, the 
diffusion Monte Carlo, removes most of the variational bias by projecting out the ground state 
component from any trial function with nonzero overlap within the given symmetry class. The 
fixed-node approximation enables to avoid the fermion sign problem and makes the method 
scalable to large sizes. 



2.1.1 Metropolis Sampling 

Basic Monte Carlo multi-variate integration approaches are based on the following simple idea. 
Assume that we are interested in finding an expectation value of j4(R) for a known non-negative 
distribution P(R) 

f ^(R)P(R)dR 

= /V(R)dR ' '''' 

where R is a position in dimensional space. This is a common expression for evaluation 
of observables encountered in classical or quantum statistical mechanics. If the dimension of 
space is large and the integral cannot be reduced to sums and/or products of low-dimensional 
integrals, its straightforward evaluation scales exponentially in the dimension d. This is easy to 
see by considering a simple rectangular integration rule with Ng grid points in each dimension. 
The total number of evaluations will be (Ng)"^ and therefore not feasible even for moderate 
values of Ng and d. The key advantage of stochastic methods is that they are immune to the 
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dimensionality problem under reasonable conditions which are easy to verify (see below). Let us 
first consider how to generate samples of the distribution V(R.). This is most conveniently done 
by the Metropolis algorithm [33] which generates a Markov chain of samples by a stochastic walk 
in the configuration R-space. The stochastic samples are sometimes colloquially referred to as 
random walkers. The algorithm enables us to sample any desired multi-dimensional probability 
distribution ■P(R) without a prior knowledge of its normalization. Given some transition rule 
P(R R') from R to R', which satisfies ergodicity and detailed balance, 

P(R H!)V{Ii) = P(R' R)P(R'), (15) 
the desired probability density ■P(R) will converge to an equiUbrium state given by 



/ 



P(R-^R')?'(R)dR = P(R'), (16) 



see, for example, [34]. The transition rule P(R — > R') can be further written as a product of the 
sampUng distribution T(R R') (typically Gaussian distribution centered around R) and the 
probability of an acceptance A(R R') of the proposed step 

P(R^R') = r(R-^R')^(R-^R')- (17) 

The conditions of detailed balance are satisfied by choosing the A(R — )• R') to be 

T(R' -> R)-P(R')' 
T(R ^ W)ViIi) 



A(R R') = min 



(18) 



In this manner, the Markov chain initialized at some random state evolves towards an equilibrium. 
After some initial convergence and equilibration period we can start to collect the statistics. 
Assuming that we can apply the central limit theorem, the usual estimators for the mean, variance 
and error of the mean are given by 

1 ^ 

= ^Y.^^^rn)+0{l/^), (19) 

m 

M 



CTA 



M 

where M represents the number of statistically independent sampling points. 

The computational demands depend on the desired error and on the variance a\. Seem- 
ingly, the estimators and the error bar do not depend on the dimensionality d of the problem. In 
general, however, one needs to take into account the impact of increasing d for the following: i) 
amount of calculation needed to obtain statistically independent sample of P(R) and for evalu- 
ation of j4(R); and ii) growth of variance u\. Providing all of these grow as some polynomials 
of d, the method guarantees the overall polynomial scaling of the algorithm. In its basic form the 
expression for computational demands is then 

_ 

^comp — ~2~ '^sample.'! (^0) 

^ A 
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where Tsampic lumps the time needed to obtain statistically independent sample and to evaluate 
required quantities. 

We note that it is often possible to decrease the variance by the importance sampUng methods 
and thus increase the efficiency as explained, for example, in Ref. [34]. 



2.2 Variational Monte Carlo 

Let us consider a trial variational wave function ^t(R-) which approximates the lowest eigen- 
state of Ti, for a given symmetry. We will elaborate on constructions and properties of '^i in 
Sec. 2.4. In variational Monte Carlo (VMC) the expectation values for the given 4't(R) are 
evaluated by stochastic sampling. For example, the total energy is given by 

/vi/^(R)7^vl/T(R)dR /|vl/r(R)pigL(R)dR ^ 
EvMC= j^^(R)^^(R)dR = /|*T(R)PdR ^^^^ 

where we have introduced the so-called local energy 

- ^^^^ 
The VMC estimator Evmc is then given as 

1 ^ 

Evmc = j^Y.^Li^) + 0{l/sfM), 

m 

assuming the set of M configurations is distributed according to P(R) = |\E't(R)P. 



(23) 



2.2.1 Correlated Sampling 



Correlated sampUng exploits the fact that the variance of difference of correlated random vari- 
ables X and Y decreases as their correlation increases. For example, consider variational energy 
energy difference for two close wave functions '^^^ and e.g., for different ionic positions 
or with small external field when evaluating responses. The energy difference can be written as 



El — E2 



e[^\r) dR 



E^^'iH) dR 



/|*«(R)PdR 



/l4^)(R)|2dR 



*W(R)| = 



,(1) 



_/|*^'^(R)|2dR /u;(R)|*^'^(R)|2dR 



w{R)E<f\R) 

(1)^ 



dR, 



(24) 



where the re-weighting factor is given by ^(R) = |^'y^(R)p/|\l/^^'(R)p. Therefore, we can 
obtain the above energy difference as a sum over weighted local energy differences. The esti- 
mations can be carried out using the set of samples drawn, for example, from the distribution 
(^'^•')^. There are other possibilities such as sampling |\l'^^''\l'y ^ | and modifying the expres- 
sions accordingly. By taking the difference, the noise mostly cancels out, since for very similar 
wave functions the random fluctuations will be very similar and the weights will be close to 
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unity. Consequently, the variance of the difference can be much smaller than the variance of 
each individual quantity alone. The actual decrease in variance depends on particular application 
but gains can be two orders of magnitude or more. The algorithm can be further generalized for 
more complicated estimators and has been developed also for the diffusion Monte Carlo method 
presented below [35]. 



2.3 Diffusion Monte Carlo 

2.3.1 Imaginary Time Schrodinger Equation 

Diffusion Monte Carlo method belongs to a class of projection and Green's function QMC ap- 
proaches. Let us consider the imaginary time Schrodinger equation 

-^^'^ = (^-^T)*(R,r), (25) 

where r is a real variable (imaginary time) and Et is an energy offset. The effect of imaginary 
time is an exponentially fast projection of the lowest state of a given symmetry from any trial 
function with non-zero overlap. This can be easily seen by expanding ^'(R, r) in eigenstates 
{$„} of W so that for time t ^ oo one finds 

lim ^(R.r) = lim y($„|*(0)) e-^^^"-^^^ $„(R) (26) 

r^OO T—^OO ' ^ 

n 

= lim ($o|*(0)) e-"(^»-^^)$o(R)- 

r— >oo 

Although the absolute ground states of the Hamiltonians we consider are bosonic, by keeping 
the wave function in the antisymmetric sector, one can find the lowest antisymmetric state. The 
parameter Et can be adjusted to E^ so that the exponential weight of ^ becomes asymptotically 
constant. It is convenient to rewrite the equation (25) into into an integral form using the Green's 
function 

G(R ^ R',t) = (R| e-^(«-^^)|R'). (27) 
Then after propagation by a time interval Ar the wave function is given by 

*(R,t + At)= ^ G(R^R',Ar)*(R',T)dR'. 



(28) 



and the propagation can be continued by iterations. 

Let us first consider the Schrodinger equation with non-interacting particles 

The equation has a familiar form of diffusion in R-dimensional space. Assuming that we have 
N particles in a 3D space the corresponding Green's function is a SiV— dimensional Gaussian 
with variance = r. The equation describes a well-known process of diffusing particles in 
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a random noise medium with the diffusion constant equal to 1/2. The key departure from tra- 
ditional methods of solving such differential equations is in the solution representation. The 
quantum amplitude ^'(R, r) is described by sampling, i.e., as a density of random walkers (delta 
functions in R— space) 

M 

*(R, r) = dens[^ 6{R - R™(r))] + 0{1/Vm), (30) 

m=l 

where by dens we denote the density estimator, which can be carried out by appropriate binning 
of the walker ensemble. The function is therefore represented only statistically so that the values 
are known up to the statistical uncertainty term as indicated above. If we substitute the ensemble 
of delta functions into the integral Eq.(28), we obtain Gaussians of width r centered at the loca- 
tions {Rm}- In order to restore the walker representation back we then sample the Gaussians, 
i.e., each walker evolves to a new position as given by 

R„(r + Ar) =R„(r) + \/A7x, (31) 

where % is a SA'^-dimensional vector of independent random numbers with Gaussian distribution 
and unit variance. The process is iterated until the long-time limit r — > oo is reached. 

Considering now more interesting Hamiltonian with external potential and interaction terms 
represented by V, one can find the following short-time approximation [36] of the Green's func- 
tion using the Trotter-Suzuki formula as 

G(R ^ R', r) = (R| c-"(^+^-^^) |R') (32) 



(27rr)-3^/2exp 



/\2 



(R - R Q 
2t 



exp 



V(R) + V(R') 



-E' 



The approximation is of the order of 0{t^). For sufficiently small time step we can propagate 
the walkers even for Hamiltonians with complicated local interactions. The last term shows 
that the propagation involves two processes: Gaussian diffusion caused by the kinetic energy 
operator, as in the case of free particles, and reweighting, which originates from the presence 
of the potential. Note that the weights will vary very strongly whenever the potential varies. 
This is true, in particular, close to any potential singularities and makes such process very noisy 
and inefficient. In order to make the sampling more efficient we need to smooth out the terms 
entering the weights and to modify the the algorithm. 

2.3.2 DMC Importance Sampling 

Let us introduce importance sampling [36-38] Instead of sampling the wave function \I'(R, r) 
we will sample a mixed distribution /(R, r) = 'J't(R)^(R, t), where ^'t(R) is some trial 
function. Multiplying the Schrodinger equation with '^t and rearranging the terms leads to the 
following equation for the mixed distribution / 

_ c?/(R,t) ^_ lvV(R,r) + V- [vc(R)/(R,t)] (33) 

+ (^i(R)-i;T)/(R,T), 
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where (R) = V In | "fx (R) | represents a drift velocity term and El (R) is the local energy 
given by Eq. (22). The corresponding short-time approximation of the Green's function is given 
by 



G(R ^ R', A.) «(2.A.)-3-/^ exp " ^' " ^^^^^^'^^ 



2At 

. f El{R) + ElCR') ^ 

X exp -At ( ^ — — ^ — - - E 

= Gd(R.^ R', At) X Gb(R ^ R', At) 



(34) 



Here, we can again interpret the action of the short-time Green's function as a diffusion-drift 
process evolved according to Gd and the re- weighting process Gb controlled by the average 
of local energies at R, R' and Et- This transformation has several implications. First, due to 
the drift, the density of walkers is enhanced in the regions where \E't(R) is large. Second, the 
re-weighting term contains the average of local energies, which for ^'y approximating ^fo, is 
close to a constant and therefore much better behaved than possibly unbounded potential 1^(R). 
By a proper choice and construction of 'ifx one can eliminate large amount of statistical noise, 
potential singularities, etc. For any large-scale calculation accurate is important and improves 
the efficiency by two orders of magnitude or more. 

In actual simulations, the walkers are initially distributed according to /(R, 0) = |^'t(R)P 
with the weights of all walkers set to unity. The propagation then proceeds by iterative application 
of the Green's function. First, the position of each walker evolves according to the diffusion- 
drift Gd from Eq. (34). However, accuracy of Go is not uniform and decreases in regions with 
strongly varying drift. In order to correct for this finite time step bias the walker move is accepted 
with the Metropolis probabiUty 

A(R^R0.mi„fl,S5ffi^iMSl(?O!!y ,35) 
V Gi,(R^R')I*t(R)IV 

The time step At is chosen to be small so that the acceptance in the DMC method is high, typi- 
cally 99% or so. The Metropolis step improves the Green's function approximation appreciably 
and decreases the time step bias due to the fact that it enforces the detailed balance condition 
which is fulfilled by the exact Green's ftmction. 

After evolving the walker position we need to update the walker weight according to 

Wm{T + Ar) = Gb (Rm ^ R^ , At) w„ (t) . (36) 

Note that over the course of the simulation some weights will grow very rapidly due to the 
exponential nature of the weight product. As a result, in the long-time limit, the weight of a single 
walker will exponentially dominate the rest. It is therefore necessary to introduce importance 
sampling also for the weights and adjust the walker population to improve the efficiency of the 
reweighting. One possible way how to accomplish this is stochastic branching and elimination 
of walkers. Consider a walker evolving from R'„, to Rm according to Goi^m ^ R'n)- The 
weight for the step becomes Wm = G_b(Rto R-m)- We can now calculate rim = iiit[?7 + 
w], where is a uniform random number from the interval [0, 1], and int is the integer part. 
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The resulting integer rim is the the numher of walker copies placed at the position R„, and 
subsequently evolved as independent walker(s). In case rim = 0, the walker is eliminated from 
the ensemble. In this manner, the branching/eUmination step enables to keep all the weights 
= 1, = 1, Af throughout the simulation. More elaborated approaches for sampling the 
weights can be found elsewhere [39]. 

After an equilibration period, we can start to collect the statistics needed for the calculation 
of the projected ground state energy 

_ /^*(R,r)?^^T(R)dR 

J^DMC = hm ^.T, /T-.N ') 



M 

or other quantities. 



/«'*(R,t)*t(R) dR 
^ Jf{R,T)EL{R)dR 
r'^ //(R,r)dR 

-^^i^4R„) + 0(l/VM) 



2.3.3 Fixed-Node Approximation 

For simphcity, we assume that the Hamiltonian has the time reversal syrmnetry (i.e., no magnetic 
field, induced currents, etc) so that the states can be chosen real. For bosonic systems, the ground 
state wave function has no nodes and is positive everywhere. However, the fermionic wave 
function is by definition antisyrmnetric, and therefore can no longer be represented by the walker 
distribution, which has to be non-negative everywhere. Clearly, for fermions, the wave function 
will have both positive and negative amplitude domains defined by the nodes — subsets of the 
configuration space where the wave function vanishes, i.e., \E'(R) = 0. (The nodal properties of 
wave functions will be studied in detail in the subsequent section.) 

We could try to overcome this problem by decomposing ^(R, r) into positive functions 
(R, r) and (R, r) and assign each walker a positive or a negative weight accordingly 

*(R,r) = *+(R,T)-*-(R,r). (38) 

However, since the Schrodinger equation is linear, both (R, t) and (R, r) independently 
converge to the same bosonic distribution, hence making the fermionic component to decay in 
time (see Fig. 1). 

This is a well-known fennion sign problem where the fermionic "signal" disappears expo- 
nentially quickly in the projection time. The method is possible to use providing one can esti- 
mate quantities of interest before the efficiency deteriorates, nevertheless, for large systems the 
method becomes very inefficient. The inefficiency can be eliminated at the price of introducing 
the so-called fixed-node approximation [36, 40-42], which restricts the evolution of positive and 
negative walkers within the separate regions defined by the sign domains of the trial wave func- 
tion 'I't(R). This is achieved by enforcing the mixed distribution /(R, r) = ^t(R-)'S'(R, t) to 
be non-negative everywhere 



/(R,t)>0. 



(39) 
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x=0 




Fig. 1. Imaginary time beliavior of the walker distributions ^' (red) and 'I'" (blue). Upper figure: at the 
beginning of the propagation (r = 0). Lower figure: after some imaginary time evolution. The fermionic 
signal decays as (^f^ — ^f") oc exp[— r(i5o^ — Eq)], where Eq — Eq is the difference of fermionic and 
bosonic ground state energies. 



In other words, ^'(R, r) shares the same zeros and sign structure as ^t(R-) for all r, so that 
Eq. (39) holds. This is an important conceptual change in the sampling process: the fixed-node 
approximation replaces the (nonlocal) antisymmetric condition by the local boundary condition 
in Eq. (39). Note that the drift term diverges at the node 4't(R) = and therefore repels the 
walkers away from the nodal regions. The fixed-node approximation has several appealing prop- 
erties. The change of symmetries into boundaries effectively bosonizes the fermionic problem 
and enables us to apply the statistical machinery to solve the evolution equation. The sketch of 
the fixed-node DMC in Fig. 2 reminds us Feynman path intergrals and the walker paths really 
are closely realated to quantum paths except that they happen in imaginary instead of real time. 

The fixed-node solution provides the lowest possible energy within the given boundary con- 
ditions and as such is a rigorous upper bound to the true ground state energy [36, 42]. This result 
also implies that resulting fixed-node energy bias is proportional to the square of the nodal dis- 
placement error and in actual applications the method has proved to be very successful as further 
elaborated later. 

The basic form of the fixed-node DMC method is conceptually rather straightforward. Let 
us therefore sketch a simple version of the algorithm. We assume that 4't(R) approximates the 
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desired groimd state and conforms to the required symmetries. The algorithm is as follows. 

(1) Generate a set of walkers {'Rm}m^i distributed according to for example, by using 
the VMC method. 

(2) Evaluate the drift v(R) = Vr In |*t(R)| for each walker. 

(3) Evolve each walker from its position R' to a new position R according to 

R R' + Atv(R) + X, (40) 

where x is a 3N-dimensional vector with components drawn from the normal distribution 
with zero mean and variance At. 



(4) Accept the move with the probabiUty 
P„ec(R ^ R') = min 



' Gi,(R'^R)*2.(R/)''-' 



(41) 



where ,s = sign[5'y(R)] and s' — sign[^7-(R')]. 

(5) For each walker calculate the number of copies positioned at R as 

n(R) = mt{r] + exp[-r(^i(R') + El(R) - 2Et)/2)]}, (42) 
where rj is drawn from a uniform random distribution on the interval [0, 1] . 

(6) Evaluate averages of interest over the ensemble of walkers and update Et as 

EriT + At) = Erir) - Cad, HMact/Ma,,), (43) 

where Mact is the actual walker population, Mave is the desired walker population and 
Cadj is a positive constant, which tunes how quickly the population would reach the desired 
value. The constant is typically chosen to rebalance the population to the desired number 
in a few steps, say, for 10 steps its value is « 10/t. 

(7) Repeat the steps (2) to (6) until the error bars of the calculated quantities reach the desired 
values. 

Note that in the step 4) we correct the short-time Green's function approximation in two ways. 
Besides the acceptance step explained previously, the fixed node approximation is enforced for 
the case when a walker violates the nodal boundary. This occasionally happens, again, due to the 
Green's function inaccuracy. Since the evolution is discretized, the inherent fluctuations in the 
diffusion step can cause the walker to cross the nodal boundary — although for the exact Green's 
function the probability of such an event vanishes. As mentioned above, the time step At is 
usually chosen to be small enough so that the acceptance is 99% or higher. Any remaining time 
step bias can be eliminated by extrapolating At — > from repeated simulations with decreasing 
At. 
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Fig. 2. On the left is a sketch of the fixed-node diffusion Monte Carlo method. The first excited state of ID 
harmonic oscillator serves as a fermionic ground state toy model. Walkers are generated from a constant 
initial distribution and are divided into "plus" and "minus" populations according to the trial function (in 
ID the node is a point). Enforcing the fixed node condition implies that the two populations never mix. The 
plot on the right shows walker diffusion and drift steps close to the node which is depicted by the dashed 
line. 



In order to clarify the fixed-node DMC scaling of the computational time in the system size 
we invoke the previously given formula Eq.(20). We assume a requirement of the fixed error 
bar e for the total energy. The growth of variance cr^ is basically proportional to the number of 
electrons N, since for large systems fluctuations from different electrons will be approximately 
independent, so that cx N. The time Tgampie then includes evaluations of the wave function 
and operators — such as the kinetic energy, etc — and we can write for Tsampie = C2N'^ + c^N^. 
The quadratic component comes from the pair terms in the Jastrow factor and also from filling 
the Slater matrices, while the cubic part is from evaluating the determinant. For systems with 
up to a few thousand electrons C2 >> C3 so that the quadratic term dominates. There is also 
some cost associated with an increase in statistical correlation of samples which we estimate as 
being of the order of TV. The overall scaling then looks like N'^. However, this appears to be a 
rather conservative estimate. For larger systems one can recast the one-particle orbitals into the 
Wannier orbitals which are localized and also impose sufficiently loose cut-off on the range of 
the Jastrow correlations. That makes the Slater matrix sparse [6] and basically eliminates one 
power of N. If we are interested in intensive quantities such as cohesion (energy per particle) that 
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eliminates another factor of N. For some energy differences correlated sampling shows similar 
improvements and so forth. The algorithms and implementation has a significant impact here 
and in practice the fixed-node DMC calculations scale approximately as where 2 < p < 3. 



2.3.4 Beyond The Fixed-Node Approximation 

The fixed-node approximation was has been an important step on the way to overcome the fun- 
damental difficulties with fermion signs. Besides improving the nodes with more accurate trial 
wave functions (see Sec. 4 and 5) or attempts to understand them (see Sec. 3), several ideas how 
to reach beyond the limits of fixed-node approximation were proposed and tested. 

Ceperley and Alder [43] developed the released-node method. As the name suggests, the 
walkers are allowed to sample the regions of opposite sign by using the guiding distribution, 
which is positive everywhere including the nodal regions. The method suffers from the fermion 
sign problem but enables to somewhat better control the loss of the fermion signal by appropriate 
choice of the guiding distribution and other tune-ups. The algorithm produces transient estima- 
tors so that the decay of fermionic excitations has to be faster then the decay of the fermionic 
ground state component. Let us explain the basic idea of the release node approaches. The 
fixed-node condition is eliminated and instead of '^t the importance sampling is carried out 
with another importance distribution \E'g which is sometimes called the guiding wave function. 
UnUke "^t, which is antisymmetric, "ia is symmetric and positive in the whole configurations 
space, allowing thus the walkers to reach any point. The DMC simulation then runs according to 
the importance sampled Eq.(33), where '^t is replaced by '^q- The resulting release node '^rn 
solution is found from f{R, r) = ^r^'^g- The release node solution is transient and can be 
quaUtatively expressed as 

*Kiv « c{t)^f{t) + (1 - c{t))^b{t), (44) 

where c(r) « exp[— — Eb)t\ and indices F and B correspond to fermions and bosons, 
respectively. The energy of the fermionic component can be evaluated from 

^ //(R,r)||i;^(R)dR 
//(R,r)|5dR 

E 'I't(Rto) IT} \ 



where a cx (Ep — Eb)- Each weight Sm carries a sign which is assigned initially as Wm{T = 
0) = sign[5'T(Rm)] and kept constant during the simulation. A simple guiding function can 
be written as = a/^*^ -I- a, where the parameter a controls the average rate of walkers 
passing into the opposite sign region. More sophisticated forms are necessary for inhomogeneous 
systems. The transient samphng efficiency can be measured by the ratio 



s (46) 
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which, ideally, should be close to unity (as is the case for the fixed-node algorithm when = 
I^'tI). As the simulation proceeds, the ratio decreases and the fermionic signal decays while 
the statistical noise grows. Despite this, the method can be useful providing one can reach the 
fermionic ground state estimator before the error bars become too large. In fact, the release 
node method was used for the famous calculations of the homogeneous electron gas correlation 
energies [23] and for small atoms and molecules later [43]. Note that the simplicity of the 
homogeneous electron gas enabled to apply this method to hundreds of electrons already in 
1980. Unfortunately, applications to inhomogeneous systems are much more difficult and have 
progressed at much slower pace. 

Another idea advanced by Kalos and collaborators is based on the cancellation of paired 
walkers with opposite signs [44]. This method has been subsequently improved to work with 
fuUy interacting ensemble of walkers [45]. The method delays the fermionic decay and enhances 
the fermion solution component, however, the current understanding is that the scaling appears 
to be exponential. An important improvement is, however, that the exponent is smaller than the 
difference between the bosonic and fermionic energies and therefore the technique has a potential 
for efficiency gains in some applications. [46]. 

Anderson and Traynor [47] combined some of the ideas in the fixed-node, released-node and 
cancellation methods in an algorithm which employed improvements such as relocation after 
the node crossing, self and multiple cancellations, and maximum use of symmetry. Among the 
systems of interest, the authors calculated the excited state of H2 '^Sj and the barrier height for 
the simplest possible chemical reaction reaction H -h H2 — > H2 -|- H [48]. The accuracy of this 
essentially exact calculation was exceptional with resulting statistical uncertainty of only 0.01 
kcal/mol. 

Among other alternative attempts, let us also mention the method of adaptive nodes (see, e.g., 
Ref. [49]). The key idea is based on representing the approximate nodal function directly by the 
walker ensemble (e.g., by positioning a Gaussian on each walker). This results in the adaptive 
description of the nodes that does not depend upon a priori knowledge of the wave function. The 
main difficulty of the method, however, comes from scaling to larger systems since one basically 
has to populate the whole configurations space with walkers. 

In simulations of quantum problems the fermion sign problem is pervasive. It appears in 
various incamations and determines computational complexity of the underlying problem. It 
has been recently shown [50] explicitly, that QMC simulations of random coupling Ising model 
with fermion sign problem belongs to the class of NP-hard problems, i.e., with non-polynomial 
computational complexity. There are exceptions which enable to eliminate the fermion sign 
problem exactly such as separable Hamiltonians or problems with certain symmetries (the half- 
filled Hubbard model, for example). However, all such known cases are "special" having some 
underlying symmetries which enable to eliminate the fermion signs. It is therefore possible that 
any exact QMC approach with broad applicability belongs to the class of NP-hard problems and 
therefore finding such method might be an elusive and exceedingly challenging, if not impossible 
goal. The next best goal should be therefore based on a more pragmatic view on what does it 
mean to "solve" a given many-body quantum problem. Perhaps the right question to ask can 
be formulated as follows: What is the most optimal scaling of a method which can calculate 
expectations within a prescribed error? (say, energy differences with errors below 0.1 eV, 0.01 
eV and so forth). Here by the error we mean the total error which contains both systematic biases 
from approximations and statistical uncertainties. The answer to this question is a lot easier, at 
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least in a qualitative sense, since already at present the electronic structure QMC methods can 
achieve accuracies of 0. 1-0.2 eV for energy differences with low-order polynomial scaling in the 
system size. Similar progress has been achieved also in simulations in other areas of physics. 

2.4 Trial Variational Wave Functions 

The great advantage of QMC methods is the capability to use explicitly correlated trial wave 
functions. Their quality is the most important factor which influences both the achieved accuracy 
and efficiency. On the other hand, repeated evaluations of (and VvI/t, V^\1/t, etc) is the most 
costly part of QMC calculation and therefore restrict possible alternatives for to forms which 
can be evaluated rapidly and scale with the system size. 

2.4.1 Basic Properties 

^j/r has to be antisymmetric with respect to exchange of both spin and spatial coordinates 

*t(PR, PS) = (-l)^*r(R, S), (47) 

where S = (cti, crjv) are discrete spin variables with values ±i and P is an arbitrary per- 
mutation with the sign equal to (—1)^. Assuming that the Hamiltonian (or other quantities of 
interest) does not involve spin-dependent terms, any expectation value requires only the spatial 
antisymmetry since the sunnmation over spin variables becomes just a permutation of indices 
inside the same spatial integral. That enables to assign the spins to particular electrons and re- 
quires imposing antisymmetry only with respect to spatial interchanges between the electrons of 
the same spin. The spins of all electrons are therefore fixed to the total spin projection equal to 
^{N^ — Ni). The first Nf particles are labeled as spin-up and the rest N — as spin-down. 
Throughout this paper we assume that the spin variables are factored out by this labeling scheme. 
Using an appropriate expansion in determinants the wave function can be made an eigenstate of 
the total spin 5^ and possibly other integrals of motion such as the total angular momentum 
and Lz in atoms. 

We demand that approximates a given bound state so that / / '^'^'H'^t and 

/ \E'^'H^\I/t do exist. The last condition is necessary for having a finite variance. Further, 
and V^'t have to be continuous, wherever the potential is finite. In addition, the continuity of 
the potential implies also the continuity of V^\I/t- Important corollary is that as *t approaches 
an exact solution, the local energy becomes constant everywhere. We can express this also as 
cr^ = (('H — (W))^) Ofor^'r $o> and it is often called as zero variance /property. Note that 
both VMC and DMC estimators have the zero variance property. (This is not automatically true 
for many QMC approaches, in particular, this does not apply whenever there are non-localities 
in the Hamiltonian. For example, many quantum lattice models such as the Hubbard model are 
nonlocal due to hopping terms.) 

We wish that the local energy is close to the eigenvalue and also that the variance as small as 
possible. Therefore we demand that any singularity from the potential is to be canceled by the 
opposite sign singularity in the kinetic energy, what gives rise to Kato cusp conditions [51, 52]. 
As the distance of electron to nucleus vanishes, vn 0, the potential energy divergence cancels 
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out when 



1 



= -Zt 



(48) 



This electron-nucleus cusp condition [Eq. (48)] can be satisfied by a proper choice of orbitals, 
removed by pseudopotentials (see Sec. 2.5) or enforced by Jastrow factor. Of course, the hy- 
drogenic solutions such as, for example. Is and 2p orbitals given as (Ai, (r) = cxp(— Z/r) and 
4'2p{r) = exp(— Z/r/2)YiTO, have the correct cusp, hi addition, it is clear that any linear com- 
bination of orbitals with the correct cusp has the same property. In particular, if all orbitals in a 
Slater determinant have correct cusps then the same is true for the determinant what is easy to 
verify by an expansion in cofactors. 

Similarly, as the distance rij of any two electrons vanishes, there is a divergence in the 
electron-electron Coulomb potential. Let us first assume that electrons i and j have unlike spins. 
In general, the wave function does not vanish at such a collision point. Close to the singularity we 
can write it as an s-wave in rij times a smooth function which depends on the rest of the variables 
R, i.e., limri^^o^(R) = exp{crij)F{'R). Since there are two contributing Laplacians A, -|- Aj 
we get 



1 a*(R) 

*(R) dnj 



(49) 



For two like spins electrons with r^j — >^ the wave function vanishes. The behavior around 
the collision point has a character of a p-wave. Let us transform the coordinates rj and Vj to 



and r+- 



Ti + Tj. The wave fimction can be then written as Zzmr^ -i.o^(R) 



/(r^j) exp(crij )i^(R). By an appropriate coordinate rotation the linear behavior around r^j — > 



can be expressed as /(r^j ) 
modifies to 



[xi — Xj) +0{{xi — Xj)'^). The like spins cusp condition therefore 



1 a* 



,=0 



(50) 



2.4.2 Trial Wave Functions Forms 

It is convenient to express \E't (R) as a product of an antisyimnetric part and an exponential 
of a symmetric correlation fimction Ucorr{^)> often called the Jastrow factor, as given by 

*t(R) = *a X exp[?7co^^(R)]. (51) 

Let us describe the forms for ^^(R), Ucorr(R) and their generalizations as commonly used 
in calculations (and also implemented in our QMC code QWalk [53]). The Jastrow part [36, 54— 
56] explicitly depends on electron-nucleus and electron-electron distances 

Ucorr{{rij}, {ru}, {rji}) = ^ xirri) + uinj) + ^ w{rij, m, rji) (52) 

il i^j i^j-I 

and is written as the sum of one-body (electron-nucleus), two-body (electron-electron) and three- 
body (electron-electron-nucleus) terms. These correlation functions describe several effects. 
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First, the form captures the electron-electron cusp conditions [Eqs. (49) and (50)]. As derived 

above, these are satisfied providing 



du„ 



1/2 (^3) 

i/2 (Ji — —CTj, 



and while all the remaining functions have zero derivatives for rij = 0. Second, the terms which 
depend on electron-ion distances enable to vary the electron density which needs to be adjusted 
once the electron-electron terms are included. Third, the triple electron-electron-ion correlations 
enable to decrease the correlation effects and decrease the local energy fluctuations. Expanded 
in the basis of one dimensional functions the components of the Jastrow factor take form 

X{r)=^crak{r), (54) 

k 

u{r) = fcuspir) + cl%k{r), (55) 

fc 

w{rij,ru,rji) =^c%'ili^[ak{rii)ai{rji) + ak{rji)ai{r^i)]bm{rij). (56) 

klm 

The above Jastrow factor has proved to be very efficient in describing significant portion of corre- 
lation effects with small number of variational parameters. More details about the implemented 
basis functions can be found below or in the QWalk documentation [53J. 

The simplest antisymmetric functional form is the determinant so that one can expand the 
wave function in spin-factorized Slater determinants as 

*i'"*-(R) = Y^dk det[. . . , iflivi), . . .]det[. . . , y^jirj), . . .] (57) 

fe 

where i = 1, . . . , Af^ and j = N-^ + 1, . . . , N). Each spin-up and spin-down Slater determi- 
nant is a function of one-particle orbitals typically found by (post-)HF or DFT methods. In the 
introductory part we have touched upon some of the properties of x^; Slater ^ Although formally 
the Slater determinants provide a complete set of antisymmetric functions, in practice, we are 
limited to use only finite expansions, very often to a single determinant. It is remarkable that the 
single determinant Slater- Jastrow form proved to be very effective for many applications, per- 
haps beyond expectations. It provides surprisingly accurate energy differences such as cohesion 
and binding, excitations, etc, due to partial cancellations of the fixed-node errors. 

On the other hand, multi-determinant expansions which are used to improve the fixed-node 
energies require rather involved and technically difficult processing such as the following tasks: 

(a) Large determinantal expansions often need computationally expensive reoptimizations of 
the determinantal coefficients. At present, with such wave functions we are thus boimd to 
small or intermediate sizes of chemical systems. 

(b) For very high accuracy calculations one-particle orbitals need to be reoptimized. Typi- 
cally, orbitals come from methods which do not incorporate the explicit correlations and 
therefore are not optimal with Jastrow correlation factor included. However, full optimiza- 
tion of orbitals in many-particle systems is still a formidable challenge. What has been 
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successfully tried is to parameterize the effective one-particle Hamiltonian such as hybrid 
DFT functionals and find optimal orbitals in such restricted parameter space. 

The above limitations motivate to think about possibilities beyond Since the determi- 

nant is the simplest antisymmetric form constructed from one-particle orbitals, it is interesting to 
ask what is the simplest antisymmetric object built from two, three or n— particle orbitals ? For 
two particle orbitals such antisymmetric form is called Pfaffian and the resulting Pfaffian pairing 
wave function is written as 

pf,(l, ...,N)= A[4>{h 2), (^(3, 4), . . . , 4>iN - 1, N)] (58) 
= pf [^(1, 2), ^(3, 4), . . . , 0(iV - 1, N)], (59) 

where A is the antisynraietrizing operator and the arguments of the pairing spin-orbital (f){i,j) 
denote both spatial and spin variables. Note that in the most basic form the Pfaffian contains only 
one pair orbital unlike the Slater determinant which requires as many independent orbitals as is 
the number of particles. 

Similarly to multi-determinantal expansions, we can expand the wave function in a linear 
combination of Pfaffians 

= Y^Wm Pf„(. . . , 4>m{i,j), . . .). (60) 
m 

Note that each Pfaffian in the expansion (60) contains different pairing orbital. The form will 
be later simplified to spatial antisymmetries only such as in the case of determinants by em- 
ploying spin-singlet and spin-triplet pairing orbitals. The description of Pfaffians together with 
applications to several systems and possible extensions is discussed in more detail in Sec. 4. 

Another way how to boost the variational freedom of trial functions is based on the so-called 
backflow coordinates. The idea goes back to Feynman and has been employed for description 
of homogeneous systems [55, 57-62]. Feynman suggested that correlations in quantum liquids 
can be approximately captured by replacing the actual coordinates R in *a(R) by quasi-particle 
coordinates X which take into account the influence from the rest of the particles. As a con- 
sequence, the nodes and other properties of ^'^(X) will be, in general, different from those of 
^^(R). Recently, some progress was also reported in applying this idea to chemical (inhomo- 
geneous) systems [63, 64]. In this review we report some of the results we obtained with these 
new developments. The quasi-particle coordinate of the i-th electron at position rj is given as 

Xi = ri+^.(R) 

= r, + ^r(R) + C(R) + (61) 

where we have again divided the contributions to one-body (electron-nucleus), two-body (electron- 
electron) and three-body (electron-electron-nucleus) backflow terms. They can be further ex- 
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pressed as 

C(R) = Ex(^i/)ri/ (62) 
$r(R) = Eu(ri,)ri,- (63) 

^««"(R) = EE[w;i(rij,ri/,rj-7)rij- +u;2(r-ij,n/,rj-7)ri/], (64) 

where Vij = — v.j. x, u and w;i with W2 are similar to one, two and three-body Jastrow terms. 
The calculational and implementation details together with some results are further discussed in 
Sec. 5. 



2.4.3 Basis Functions for the Jastrow Correlation Factor 

The one-particle orbitals and also the correlations has to be represented in an appropriate ba- 
sis functions. There is a vast amount of literature on basis sets for expressing the one-particle 
orbitals. Typically, the localized basis sets are based on Gaussians, while for periodic systems 
plane waves are frequently used. Other options exist such as numerical orbitals, naixed basis, 
etc. We have little to add to this technically involved subject except that QWalk is able to import 
several types of these. We refer the reader to the literature or manuals for the actual packages. 

However, we briefly outline the basis which is used for correlations in the Jastrow factor. The 
exact electron-electron cusp conditions are satisfied by the choice of foUowing cusp function [see 
Fig. (3)] with variable x = r/rcuu where Vcut is some cutoff radius and 7 is the curvature as 

The cusp constant is c = | for electrons with like and c = ^ for electrons with unlike spins. 

Polynomial Pade functions for the same variable x = r/rcut and curvature /3 had proved to 
be a convenient choice for describing the electron-electron and electron-nucleus correlation. In 
calculations, we use the form 

a.(r), hM = fpoiy-Paaeix, P) = ^ ^ _ ^ 3^2) (66) 

The fpoiy-Pade{0) = 1 with derivative fpoiy-Padei^) = ^l^o goes smoothly to zero 
as r ^ Tcut [see Fig. (4)]. These conditions are necessary for preserving the cusp conditions 
already fixed by the cusp functions and by the choice of orbitals or pseudopotentials. Polynomial 
Pade functions are then used as basis functions in expansions for correlation part in the Jastrow 
factor, for example, for the electron-nucleus part we have 

X{r) = ^crak{r), (67) 



where Ck is an expansion coefficient and basis functions Ofc (r ) with different k differ in curvatures 
^fe. Similarly for other components as given by Eq. (67). 
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Fig. 3. Cusp functions with smooth cut-off for two different values of the curvature parameter (7=1 and 
7 = 10) for hke (c = |) and unUke (c = |) spins. 

2.5 Pseudopotentials 

To a high degree of accuracy, most properties of real systems are determined by the states in 
valence electronic sub-shells only. Atomic cores are also expensive in QMC simulations since 
the computational cost increases with the atomic number Z as « (see Ref. [65, 66]) and 
makes the all-electron calculations unnecessarily difficult for heavy atoms. The presence of core 
electrons complicates calculations for several reasons. The short length scales associated with 
atomic cores states require decrease in the QMC time step. Even more difficult problem is the 
increase in fluctuations of the local energy as Z gets higher due strongly varying kinetic and 
potential terms close to the nucleus. This implies also basic efficiency issue since almost all 
computational time would go to sampling energy fluctuations in the core instead of the important 
valence effects. 

In analogy with theories such as Hartree-Fock and DFT, in QMC the atomic core states can 
be eliminated by introducing an pseudopotential (or effective core potential) operator which rep- 
resents the impact of core on valence states. The action of pseudopotential is typically different 
for states with different angular momenta. It is customary to divide the pseudopotential operator 
into local (common to all angular momenta) and nonlocal (different for each angular momentum) 
parts. For a given ion we can therefore eliminate the core states and replace the ionic term as 
follows 

— ^ UP' ^ yP'ir) + (68) 
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where V^" (r) is the local part while 

^nzoc ^ J2 yP'ir) J2 \lm){lm\ (69) 

/ m 

is the nonlocal pseudopotential with projectors over Im states of angular momentum. The non- 
local portion of the local energy i?/,(R) from the electron i in the field of the ion / is given 
by 



S^.rpsf ^2^ + l f , *T(ri, ■ • ■ ,r',, . . . ,rjv) 

, 47r WT(,ri, . . . ,ri, . . . ,rArj 



(70) 



where Pi denotes the Legendre polynomial of degree I. The angular integral above is over the 
surface of the sphere passing through the i-th electron and centered on the ion. The integral is 
evaluated numerically using Gaussian spherical quadrature grids. When the orientation of axes 
of a given quadrature are chosen randomly, the result is an unbiased Monte Carlo estimation of 
the given integral (70). For more details, see original Ref [67]. 

In the DMC method, the application of the nonlocal operator W"^°'^ causes the matrix el- 
ement (R| cxp[— Ariy"'"'^] |R'), and therefore also the propagator, to have a fluctuating sign. 
This would create its own fermion sign problem even for a single electron. Consequently, the 
imaginary time Schrodinger equation for the function /(R, t) ~ 5't(R)^(R, t) is rearranged 




Fig. 4. Polynomial Fade functions with curvatures P ranging from -0.99 to 100. 
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as given by 

_dfi^ = - ^VV(R,r) + V • [vb(R)/(R,t)] + [El(R) - i;T]/(R,T) 

/ ynioc^T(R) T4;oc^(R, r) I 
^\ *t(R) *(R,r) J-^^^'^^' ^'^^ 

where y„/oc denotes the sum of all nonlocal pseudopotential components from all ions. Note 
that, unfortunately, we do not know how to calculate the last term since it contains the solution 
which is a priori unknown. We therefore introduce the localization approximation by neglecting 
the last term in Eq. (71). It has been shown [67] that the error in energy from the localization 
approximation cx — $o)^- However, since the Hamiltonian is modified due to the neglected 
term, the obtained energy is not necessarily an upper bound to the true eigenenergy of the orig- 
inal Hamiltonian. Fortunately, accurate Jastrow-Slater trial wave functions enable to carry out 
calculations without appreciable impact from the localization error. Calculations have shown 
that the localization approximation bias is typically smaller then the fixed-node bias [68, 69]. 
Therefore this approximation proved to be less of a problem in practice since the localization 
error is overshadowed by the fixed-node error. The issue of the upper bound has been resolved 
some years ago once ten Haaf, van Bemmel and co-workers [70, 71] suggested modification of 
Hamiltonians with nonlocal terms which does not violate the variational principle. Let us write 
the matrix element of the nonlocal part as 

(R\Vnioe\n') = (R\V+jn') + (R\V-,JR') (72) 

where {R\V^i^JR') are matrix elements for which (R|1/+^^|R')'I't(R)4't(R') > and vice 
versa for (R|V17„„|R')- For small times we can write 

* ' ft too ' ' 

(R| - ArVniocln') = 5(R - R') - (R|F„/oc|R')At + 0[{ATf] (73) 

so that it is clear that the negative matrix elements do not flip the walker sign and can be sampled 
without introducing any sign problems. The sign flipping (positive) part is evaluated by the 
projection onto the trial function very much like in the localization approximation, i.e., 



= *t(R)-^ j (R|yUR'>*T(R')dR' (74) 
so that the new modified Hamiltonian has a form 

n = Hioc + V+j + j (R|y„7„JR')dR' + F-,,(vI/t(R) = 0) (75) 

where T-Lioc contains all the local parts such as kinetic energies and local interactions including the 
local components of pseudopotentials. The the node term enforces the fixed-node approximation 
explicitly (otherwise V^^j would be unbounded at the nodes). Interestingly, one can explicitly 
show that 

(*T|?i|*T) > (*T|(^ioc + Kioc)|*T) (76) 
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for any '1'^ while the equality takes place when "^t = '^o, i-S-, the energy of the modified 
Hamiltonian is always an upper bound to the original H = Hioc + Vnioc- New algorithmic 
developments by Casula and coworkers [72] enabled to implement this treatment of nonlocal 
operators into the DMC methods as the so-called T— moves. 

Pseudopotentials are also very useful for heavy elements which require inclusion of relativis- 
tic effects. The scalar relativistic corrections can be built-in into the pseudopotentials and for 
very heavy elements this is basically the only choice if one wants to avoid the full relativistic 
treatment. 

Although this section seem somewhat specialized and technical, the results have wider impli- 
cations. Note that the last development enables us to work with any type of nonlocal operators 
using the decomposition from Eq. (72) and the effective upper-bound Hamiltonian from Eq. (75). 
Clearly, this opens possibiUties for simulations with more complicated Hamiltonians, beyond just 
electrons and ions. 



2.6 Optimization of Variational Wave Functions 

The quality of trial wave functions has a significant impact on the efficiency of VMC and DMC 
methods as well as on the accuracy of the final results. The presented functional forms typically 
depend on a number of linear and non-linear parameters which has to be optimized. The opti- 
mization of functions is a classic numerical problem and many approaches can be found in the 
literature. However, in combination with random fluctuations and objective functions defined 
by the finite ensemble of walkers, the problem becomes more comphcated. We will therefore 
outline a few methods used for this purpose. 



2.6.1 Variance Minimization 



Let us denote the set of variational parameters {c} of some real valued trial wave function ^'{c}- 
The mean value of the local energy with respect to is given by 



4tc = rti = (4^^> - Ei'^^- (77) 



The variance of the local energy can be written as 



''{^> - f%ydIL - ^^^^ 



= ((i?r>-^W)2). (78) 



Since the variance directly determines the efficiency one possible approach for wave function 
optimizations is minimize the variance [56] in Eq. (78) using a finite set of fixed configurations, 
where the walkers are distributed according to ^{co}' starting variational parameters is 

denoted as {cq}. The variance optimization is a rather robust strategy since it is always positive 
and bounded from below. In practice, the mean value of the local energy is not a priori known, 
so we replace E^'^^ in Eq. (78) by a reference energy Eref- 

The method can be further improved. One possibility is to use weights which account for the 
wave function change after the parameter updates. The re-weightened variance with the new set 



34 M. Bajdich and L. Mitas 



of parameters {cnew} is then given as 

where we have introduced the weighting function 

^ic„.„>(R) = -^-^. (80) 

The advantage of the reweighting scheme is a more accurate value of variance at each optimiza- 
tion step. However, once the wave function departs significantly from its original distribution 
one has to update the walker positions as well to reflect the updated wave function otherwise the 
method becomes unreliable. 

For minimization algorithms we use two different methods. The first one is a modified ver- 
sion of a quasi-Newton method [73]. It uses only the values and numerical gradients of the objec- 
tive function (in this case the variance) and builds up the inverse Hessian from updates along the 
optimization trajectory. Second algorithm is based on the general Levenberg-Marquardt (LM) 
method. It can be smoothly tuned between Newton and steepest descent approaches with a built- 
in stabihzation parameter. However, it requires both gradient and Hessian and is therefore more 
costly to calculate. The gradient of the variance with respect to i-th parameter is readily obtained 
by differentiating Eq. (78) as 



iE.AE. - E)) + {^El) - {^){El) -2E-{^iE,-E) 



(81) 



where the subscript i denotes Since the variance minimization method can be viewed as a 
fit of the local energy for a fixed set of configurations [56], by ignoring the change of the wave 
function an alternative expression for the same gradient follows as 

a1=2{EL,i{EL-E)). (82) 

The Hessian derived from the gradient (82) is then given as 

al=2{{EL,i-E){EL^^-E)). (83) 

An important property of this approximate Hessian (83) is that it is symmetric in i and j and 
positive definite even for a finite set of configurations what is not automaticaUy true when using 
the formally exact expression. 



2.6.2 Energy Minimization 

In general, a straightforward minimization of the energy in Eq. (77) can be quite unstable. This 
is somewhat counterintuitive and has its origin in the finite sample. The reason is that for a 
sufficiently flexible variational wave function it is possible to seemingly lower the energy for 
the fixed set of configurations, while, in fact, raising the true expectation value. As we have 
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mentioned earlier, the variance is bounded from below and therefore this problem is not present 
even for the finite sample [56]. Although one can frequently re-sample the wave function so that 
the method is always close to the actual minimum one can do better with some modifications. 

We can improve the method convergence by employing the LM method and information 
about gradient and Hessian of the local energy mean. The corresponding gradient can be readily 
obtained from Eq. (77) as 



E, 



= 2 



2E- 



{El - E) 



(84) 



where we used the fact that H is Hermitian. Note, that the expression in the last step of Eq. (84) 
has a favorable property of zero fluctuations as \1/ approaches the true eigenstate. By differenti- 
ating the Eq. (84) we get the Hessian as 



Ei, 



$2 



{El 



Ei + 



E) 



'-E 



(85) 



It is clear that the above Hessian is not symmetric in i and j when approximated by a finite 
sample and the expression leads to quite large noise. Umrigar and Filippi [74] have recently 
demonstrated that the fully symmetric Hessian written entirely in the terms of covariances {{ab) — 
(a) (&)) has much smaller fluctuations then the Hessian from Eq. (85). The modified Hessian from 
Ref . [74] is given by 



Ei. 



+ 



\I/2 



{El-E))- 



\ * 



-E 



+ 



EL,i)- 



{EL,i) 



(86) 



where we have added three additional terms with vanishing expectation value (for proof, see 
e.g. Ref. [75]). The terms make the Hessian symmetric while at the same time decreasing the 
variance. Hence, addition of terms with zero expectations helps to cancel most of the finite 
sample fluctuations and makes the minimization method vastly more efficient. 
Another useful rearrangement of the Hessian (86) is 



Eij =2 



^2 



\ * 



+ 



E 



{Elj 



+ 



{El 



E) 



Er 



{El 



(87) 



If the minimization procedure involves only the linear parameters in the exponential (i.e., of the 
form exp[c(i)/]) such as hnear coefficients in the Jastrow factor, the two terms on the first Une 
of Eq. (87) cancel out, eliminating thus the need for evaluation of the term. For details on 
Levenberg-Marquardt method and other minimization see, for example, Ref. [76]. 
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3 Nodal Properties of Fermionic Wave Functions 
3.1 Introduction 

For a long time, nodes of fermionic wave functions and also related nodes of density matrices 
were considered uninteresting, since they are very complicated and their usefulness or connection 
to physical effects were unclear. For a given real state *(R), the node is defined as the subset 
of R-space for which \I/(R) = 0. The node divides the configuration space into nodal domains 
with constant wave function sign, "+" or Interestingly, the first ideas on nodal properties 
of eigenstates go back to Hilbert and later to Courant. In particular, Courant proved the famous 
theorem that the n— th eigenstate of the 2D Laplacian on simply connected convex 2D regions 
with Dirichlet or Neumann boundary conditions have n or less nodal domains. Later this has been 
further expanded into higher dimensions and more complicated Hamiltonians. In the context of 
physics problems, Breit analyzed the nodes of two-electron states in 1930 and he actually found 
the first exact node for the atomic ^P(2p^) state [77|. More recently, the behavior of nodes 
for highly excited (quasi-classical) states were studied and various bounds on nodal (Hausdorff) 
volumes for n — >^ oo were established. As we have seen, the nodes play an important role in 
eliminating the fermion sign problem and knowledge of exact nodes would enable us to obtain 
fermionic ground states essentially exactly. 

In relation to QMC methods, the nodes were perhaps for the first time encountered and identi- 
fied as a "problem" in the paper by Anderson in 1976 [41]. The fermion nodes of small systems, 
mostly atoms, were investigated in several other papers [12, 40, 78-80]. The fermion nodes for 
degenerate and excited states were further studied by Foulkes and co-workers [81]. Bressanini, 
Reynolds and Ceperley demonstrated differences in the nodal surface topology between Hartree- 
Fock and correlated wave functions for the Be atom and explained thus the large impact of the 
2s, 2p near-degeneracy on the fixed-node DMC energy [82]. More recently, improvements in 
fixed-node DMC energies of small systems were studied by using CI expansions [83, 84], and 
pairing wave functions [1,2, 85, 86]. 

Some of the general properties of fermion nodes were analyzed in an extensive study by 
Ceperley [87], which included proof of the tiling and connectivity properties as well as gener- 
alizations of the notion of nodes for density matrices. From few-electron systems as well as 
from numerical studies of free particle electron gas in 2D and 3D [87] came the conjecture that 
fermionic ground states have only two nodal cells (one "+", one "-"). The paper laid down some 
of the ideas, however, explicit demonstrations were missing even for non-interacting systems. 
Explicit proofs of this conjecture for several important cases was found later [88, 89], when 
one of us used the property of connectivity to show analytically that a number of spin-polarized 
non-interacting and mean-field systems (non-interacting homogeneous electron gas, HF atomic 
states, harmonic fermions and fermions on a sphere surface) have ground state wave functions 
with the minimal number of two nodal cells [88, 89]. Furthermore, using the Bardeen-Cooper- 
Schrieffer wave function, it was shown that for in spin-unpolarized systems (i.e., with both spin 
channels occupied) an arbitrarily weak interaction reduces the four non-interacting nodal cells 
again to the minimal two. Finally, in the same paper was also demonstrated that the minimal 
number of nodal cells property extends to the temperature density matrices. 

From the point of view of QMC methods, the two key issues are the nodal topologies and the 
nodal shapes and their efficient description. The topologies are important since they determine 
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how the configuration space domains are connected, property crucial for the correct sampHng 
process. In addition, the topologies are also related to physical effects such as multiple particle 
exchanges and pairing. The shapes are important for the accuracy and so far our abiUty to de- 
scribe these with sufficient flexibiUty is quite linnited. Some recent advances on this aspect will 
be presented later in the review. 

This section is organized as follows. In Sec. 3.2, we outline some of the general properties 
of fermion nodes. In Sec. 3.3, exact fermion nodes are described for few-electron spin-polarized 
atomic states, including the only known three-electron case discovered very recently. In Sec. 3.4, 
we categorize the nodal surfaces for the several half-filled sub-shells relevant for atomic and 
molecular states. In Sec. 3.5, we show how opposite spin correlations eliminate the redundant 
nodal structure of HF wave functions for cases of spin-unpolarized states. 

3.2 General Properties 

Let us consider a system of N spin-polarized fermions in d dimensions described by a real 
antisymmetric wave function ^(R). We assume that ^'(R) is an eigenstate of a Hamiltonian 
which, for simplicity, contains only local interactions. Consequently, there exists a subset of 
electron configurations called a fermion node, for which the wave function vanishes so that we 
can specify the nodal subset by an implicit equation 

*(R) = 0. (88) 

Assuming that \E'(R) describes the ground state, from the nodal subset we omit regions where 
the wave function might vanish because of other reasons (e.g., external potential, additional sym- 
metries, etc). Note that when we talk about the nodes, we have in mind the nodes of many-body 
wave functions in the space of all particles. These nodes have nothing to do with familiar nodes 
of one-particle orbitals such as the angular nodes of spherical harmonics Yi^ or radial nodes of 
radial one-particle orbitals. In general, the many-particle state node is an {Nd — 1) -dimensional 
hypersurface defined by the implicit Eq. (88). Whenever positions of any two electrons coincide 
(i.e., Tj = Yj), the antisymmetry ensures that the ^(R) = 0. For d > 1, these coincidences 
do not fully specify the nodes, but only define {Nd — rf) -dimensional subspaces of coincidence 
planes. As we will see, the coincidence planes will play an important role for the nodes in ID 
systems. 

Another definition which we will need is the nodal domain. The node divides the configu- 
ration space into subsets in which the wave function sign is constant and which are called nodal 
domains (or cells). The nodal domain definition can be reformulated also in terms of paths in the 
configuration space. A nodal domain fi(Rt) is a subset of configurations which can be reached 
from the point Rj by a continuous path without crossing the node. 

Let us now introduce the basic properties of wave function nodes. 

(a) Manifold almost everywhere — Uhlenbeck [90] proved that the node is generically a man- 
ifold, i.e., it can be locally mapped onto a {dN — 1) -dimensional plane except, possibly, 
for a subset of zero measure. This assumes reasonable Hamiltonians and a few other gen- 
eral conditions. Note that this does not preclude two or more hypersurfaces to cross since 
crossing is automatically lower dimensional, i.e., of zero measure. 
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(b) Nodal crossing angles — If two nodal surfaces cross each other, they are orthogonal at the 
crossing. If n nodal surfaces cross each other, the crossing angles are all equal to n/n. 

(c) Symmetry of the node — The symmetry of the node cannot be lower than the symmetry of 
the state. Typically, these symmetries are the same, however, there are exceptions. There 
are known exact cases with the node synmietry being higher than the state synmietry. 

(d) Tiling property for non-degenerate ground states — The tiling property says that by ap- 
plying all possible particle permutations to an arbitrary nodal cell of a ground state wave 
function one covers the complete configuration space (i.e., n(PRt)-|-n(Rt) = fH). 
Note that this does not specify how many nodal cells are there: the number of nodal do- 
mains can be any even number between two and A^!. We will use this property later. 

Let us analyze the nodes for several models. It is instructive to start with ID systems for 
which the exact ground state nodes can be found explicitly and for arbitrary interaction. The 
reason is that in ID the coincidence and the nodal hypersurfaces are identical due to geomet- 
ric constrains of ID space. Let us first consider TV spin-polarized fermions in a ID harmonic 
oscillator well. The wave function is given by the Slater determinant 

N) = det[0fe(r,)] = A n e""' /'det[l, 2x, Hn-i{x)], (89) 

i 

where Hn{x) is a Hermite polynomial of degree n and A is the normalization. We omit the 
prefactors and transform the Slater matrix to monomials so that the wave function is given by the 
Vandermonde determinant 

*ic(l,...,A^) =det[l,ar,ar^...,ar^-l] = l[{xj - Xi). (90) 

The node is encountered whenever two fermions pass through each other and the wave function 
has N\ nodal cells, since any permutation requires at least one node-crossing. In general, the 
derived node is exact for other ID models including systems with arbitrary local interactions 
(which are not too singular). For periodic boundary conditions one can find a similar expression. 
Let us consider N = {2kF + 1) particles in a ID periodic box (— tt, tt), so that the coordinates are 
dimensionless. In this setting the Fermi momentum becomes an integer, kp = 1, 2.... The one- 
particle occupied states are written as (l>n{x) = e*"^, n = 0, ±1, ikp and the spin-polarized 
ground state is given by 

*iz5(l,...,7V) =dct[0„(xj)], (91) 

where xj is the j-th particle coordinate and j = 1, A^. The term exp{—ikF J2j ^j) 
factorized out of the determinant so that the Slater matrix elements become powers of zj = e'^^ . 
This leads to the Vandermonde determinant and after some rearrangements we find 

..•,N) = e-'^^^o-i ^{zj - Zk) = A JJ sin(a;,fc/2) (92) 

j>k j>k 

where Xjk = Xj — Xk and A is an unimportant constant prefactor. The periodic boundary con- 
ditions result in the same ID nodal hypersurface which is also manifestly periodic, as expected. 
Again, for ID periodic ground states these nodes are exact irregardless of interactions. 
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Properties of nodes for d > 1 are very different and also far more difficult to analyze than in 
one dimension. The key difference is that the node dimensionality is higher than the dimension- 
ality of the coincidence hyperplanes. Clearly, for > 1 particles have enough space to "avoid 
each other" and that has implications on the nodal topologies and the number of nodal cells. In 
addition, this spatial or dimensional flexibility also means that the nodal hypersurfaces can be 
and are affected by interactions. 

We first consider spin-polarized non-interacting fermions in a 2D harmonic well. The one- 
particle states are simply (pnm = CnmHn{x)Hm{y), n,m = 0, 1, ... where Cnm includes the 
Gaussian and normalization which are absorbed into a common prefactor and omitted. The Slater 
matrix elements can be rearranged to monomials as follows 

^2D{l,-,N)=dct[l,x,y,...,x"y"',...]. (93) 

The closed-shell states and the system size are conveniently labeled by M — 1, 2, ... where 
n + m < M, with the corresponding number of fermions given by = (M + 1)(M + 2)/2. 

Let us illustrate the qualitative difference in nodal topologies between higher dimensions 
and ID systems on the following example of three harmonic particles. For M = 1, the three- 
particle wave function is given by ^'2d(1, 2, 3) = det[l, x, y]. In order to understand the nodal 
domain topology, it is convenient to extend the 2D coordinates by a "dunrniy" third dimension 
as Ti = {xi, yi,0). Then 5'2d(1, 2, 3) = zq • (r2i x rai) where Zq is the unit vector in the third 
dimension and = — rj . Note that while in ID three particle state exhibits 3! = 6 nodal cells, 
the result we just derived implies something very different. There are only two nodal domains 
since the set of vectors Zo,r2i,r3i is either left- or right-handed, and the node is encountered 
whenever the three particles become coUinear, i.e., r^i x r^i = 0. Another important property 
which will be explored in detail below, is the fact that it is possible to carry out a triple exchange 
1— >2, 2^-3, 3^1 without node-crossing (e.g., rotate an equilateral triangle). 

The triple exchanges are important for proving the two-nodal cell property in general cases 
for arbitrary number of particles. In order to explain this aspect, we introduce a notion of particles 
connected by triple exchanges. The three particles i, j, k are called connected if there exist a 
cyclic exchange path i^j,j-^k, k^i, which does not cross the node. Let us now suppose 
that there exist a point R* inside a nodal domain such that all particles are connected into a single 
cluster of triple exchanges (see Fig. 5). Then ^'(R) has only two nodal cells. In other words, the 
whole configuration space is covered by only one positive and one negative nodal cell (i.e., the 
nodal cells are maximal). Let us try to understand why this is the case. First, any triple exchange 
is equal to two pair exchanges. The existence of the point Rj with all particles connected then 
means that all positive (even) permutations P+Rf can be carried out within the same nodal cell. 
Second, the tiling property implies that once all particles are cormected for R^ the same is true 
for entire cell rJ(Rt). Therefore, there will be only one maximal cell per each sign. More details 
on this construction can be found in Ref. [87]. 

We are now ready to show that six particles (M = 2) in the harmonic 2D well also have the 
ground state with the minimal nodal count of two. We position particles 1, 2 and 3 so that they 
have common y— coordinate, t/o- For M = 2 the wave function reads 



*2d(1,-,6) = d.et[l,x,y,x^,xy,y'^] = 



40 



M. Bajdich and L. Mitas 



All-particle 
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Fig. 5. Sketch of a connected cluster of triple exchanges. Each triangle means that there is a triple exchange 
path which does not cross the node. 
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Fig. 6. Positions of particles for proving the connectivity of six 2D harmonic fermions. 
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where the particle configuration which will enable to figure out the connectivity is given in Fig. 6. 
All elements containing yo can be eliminated by appropriate rearrangements and the determinant 
factorizes as 

6 

*2c(l, 6) = Hivi - yo)*ii3(l, 2, 3)*2i3(4, 5, 6). (95) 

i=4 

It is now clear that we can rotate the triangle of particles 4, 5, 6 without crossing the node since 
^2£)(4, 5, 6) remains constant and the wave function does not change the sign. That implies 
that particles 4, 5, 6 are connected. However, the wave function is rotationally invariant so the 
particles 1, 2, 4 must be connected, as well as the particles 2, 3, 5. Therefore all six particles are 
connected and the wave function has only two nodal cells. 

With some effort it is possible to generalize these types of constructions and show that har- 
monic particles are connected for any closed-shell state with arbitrary M — > oo and for arbitrary 
d as given elsewhere [88]. 

It seems that the harmonic oscillator might be somewhat special due to the simple structure 
of the wave function, since the nodes are zeros of antisymmetric polynomials [89]. In fact, the 
minimal two-nodal cells property appears to be generic. Let us present an example which is 
more realistic and does not reduce to simple polynomials. Consider an atomic spin-polarized 
state ^S{ls2s'iip'^), i.e., 5 electrons in the Coulomb —Z/r potential for both non-interacting and 
HF wave-functions. The wave function can be written as 



^at{l,-,^)=dei[pUr),p*^^{r),x,y,z\, (96) 

where p\^{r) = Pu{r)/p2p{r) and p^^ir) = P2s{r) / P2p{r) since non-negative p2p{r) is fac- 
torized out. The coordinates become dimensionless by rescaling with the atomic number Z as 
r ^ Zr. Let us position particle 1 at the origin and particles 2 to 5 on the surface of a sphere with 
the radius rjo equal to the radial node of P2s{'r) orbital, i.e., p2s{vo) = 0. For such configurations 
we obtain 



*at(l, 5) = P*isim)P*2s{0)^32 ■ (r42 X Y52) (97) 

Three-particle exchanges 2, 3, 4, 5 easily avoid node-crossings by appropriate positioning and 
rotations on the sphere, e.g., using positions of the tetrahedron vertices. We can also show that 
particle 1 is connected by the exchange 123 — > 312 which is parameterized as ri (t) — 779 [t , 0, 0] ; 
Y2{t) = ?7o[c(i), s(f), 0] and Y^it) = 7?o[0, 1 — t,Q\ where t = Q {t — \) corresponds to the 
beginning (end) point of the exchange path while c{t) = cos(7rf/2) and s{t) = sin(7rt/2). 
Setting r4 = [0, 0, r/o] and rs = [0, 0, — ryo] we find that ^at is proportional to 

P*2siino)c{t){l -t)+ pU{1 - t)r)Mt)t > (98) 

The inequality holds for the whole path < t < 1 since P2s{t'no) > for < f < 1 for 
both non-interacting and HF cases. Note that the demonstration is non-trivial since the HF wave 
function reflects the mean-field interactions and the orbitals can be found only by a numerical 
solution. Nevertheless, qualitative features of the HF and non-interacting orbitals (symmetries, 
structure of radial nodes, etc) are basically identical and enable to verify the nodal cell count for 
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both cases. Note also that the wave functions is not a multi-variate polynomial like for the case 
of harmonic potentials. The proof can be further extended to more shells with more electrons 
such as Is2s2p^3s3p^ and Is2s2p^3s3p^3d^ [89]. 

Similar two-nodal cell demonstrations can be carried out for many paradigmatic quantum 
models such as 2D and 3D homogeneous fermion gas, fermions on a sphere, etc [89]. In addition, 
the same can be shown also for the temperature density matrices, for which one can define the 
nodes in similar way as for the wave fimctions. 



We assume the usual electron-ion Hamiltonian and we first investigate a few-electron ions fo- 
cusing on fermion nodes for sub-shells of one-particle states with ,s,p, rf, ... symmetries using 
variable transformations, symmetry operations and explicit expressions for the nodes. 

3.3.1 Two-Electron Atomic ^P(p^), ^S{ls2s) and Molecular ^^^(Tr^) IViplet States 

Apparently, the exact node of the atomic ^P{p'^) state was derived in a different context by Breit 
in 1930 [77, 83, 91]. Here we offer an independent proof which enables us to apply the analysis 
to molecular states with the same symmetries. The state is the lowest within its symmetry class. 
It has even parity and cylindric symmetry, say, around z-axis. It is also odd under rotation by tt 
around any axis orthogonal to the cylindric axis. We denote the rotation by tt around, say, the 
x— axis, as R{'kx). In order to uncover the exact node it is convenient to define new coordinates. 
Let us denote r^2 = ri+r2,rj^2 = | 1' together with the customary r 12 = ri — r2,ri2 = |ri2|. 
We can now introduce the following coordinate map 



where cosuj = zq ■ (ri x r2)/|ri x r2| with zq being the unit vector along the z-axis, cos /? = 
''12 ■ ''i2/(?' 12^12)' ™d (fi' being the azimuthal angle of ri x r2. Further, (f is the azimuthal angle 
of in the relative coordinate system with the x-axis unit vector given by a projection of zq into 
the plane defined by ri, r2, i.e., e^; = zop/|zop|, = (ri x r2)/|ri x r2| and Gj, = x Gj;. 
The coordinate ip' can be omitted from further considerations right away due to the cyhndric 
symmetry so that the wave-function dependence simplifies to ^ {r f 2, r 12, cos ^, cos uj,(f). We 
introduce an operator Q which acts on coordinates and is given hy Q = PjPi2R(nx) where Pj 
inverts around the origin and P12 is the pair exchange of particles I and 2. Considering the state 
symmetries, action of Q reveals that the wave fimction is invariant in the simultaneous change 
(cos 13, if) — >■ (— cos P,—ip). Combining action of Q and P12 we find that 



with the rest of variables unchanged. The node is therefore given by cos oj — 0, i.e., Zq • (ri •r2) = 
0, and it is encountered when both electrons happen to be on a plane which contains the z- 
axis. This exact node therefore agrees with the node of non-interacting or HF wave function 
^ = det[p{r)x, p{r)y]. 

The derived node equation is applicable to any cylindric potential with Dcoh symmetry, e.g., 
equidistant homo-nuclear dimer, trimer, etc, with one-particle orbitals tTx, ny which couple into 
the triplet state ^Eg(7rx%). 



3.3 Exact Nodal Surfaces 



(ri,r2) (rj'2,ri2,cosa;,cos/3,</9,<^')> 



(99) 



^(..., — cosa;, ...) = —'!'(..., cos w, ...) 



(100) 
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Note that the parametrization given above leads to the exact node for the lowest two-electron 
triplet state, "^5(1525) which has been known for some time [12, 78]. The spherical symme- 
try makes angles cj and (/? irrelevant and simplifies the wave function dependence to distances 
ri,r2,ri2 or, alternatively, to ri2 , r J*^ , cos l3. Applying P12 leads to 

-*(ri2,r^2.cos^) = *(ri2,r^2>-cos^), (101) 

so that the node is given by the condition cos/3 = 0, i.e., ri — r2 = 0, or, equivalently, rf = 
r^. The last equation clearly shows that the node is generically a quadratic manifold, namely, 
a 5D hyperboloid in 6D space (one recalls that in 3D space the 2D hyperboloid is given as 
± const = x"^ — y^). Again, the nodal domain count is two and the domains correspond to 

ri > r2 or ri < r2. 

For the presented states the exact nodes are determined solely by symmetry and are indepen- 
dent of interactions. That further implies that all the excited states with the same symmetries will 
contain the ground state node although they will exhibit also additional nodal hypersurfaces. We 
will come back to this point in the following subsection. 

3.3.2 Three-Electron Quartet ^5(2^3) State 

Recently, we have found another case of exact node for a three-electron atomic state, namely, the 
lowest spin quartet of S symmetry and odd parity, ^S{2p'^). It is actually very easy to find the 
node by transforming it to the previously analyzed cylindric symmetry. We can always rotate the 
system so that one of the electrons, say, the electron 3, is positioned on the z— axis so that the 
remaining symmetry is only cylindric. However, then the wave function for the remaining two 
electrons is identical to the previous ^P{2p^) state and we can directly write the nodal condition 
as r3 • (ri x = 0. The wave function vanishes when all three electrons appear on a plane 
which contains the origin. 

One can demonstrate the same by defining an alternative coordinate map cos a = rs • (ri x 
i'2)A3|''i X r2|, cos/3 = [(ri x r2) x r^2] •ri2/[|(ri x ^2) x ri"2||r^2l] and 7 being the azimuthal 
angle of in the relative coordinate system given by the following unit vectors, = i" 12/^12' 

= [[(ri X r2) x ri2] x rj'2]/|[(ri x r2) x ri2] x rj'jl and e^, = x e^:. Besides these relative 
coordinates, the three Euler angles fix the orientation in the original coordinate system, however, 
the S symmetry makes them irrelevant. In the new coordinates the node comes out by the action 
of P\2 on 4'(rj^2i '''12, rs, cos a, cos /3, 7) as 

— cos a, cos /3, 7) ~ — cos a, cos ^, 7). (102) 

It is clear that P2?, and P13 would lead to the same nodal condition, assuming one would redefine 
the relative coordinate map accordingly. The node of non-interacting and HF wave function 
"^HF ~ dot[/9(r)a;, p(r)y, p(r)z] again agrees with the exact condition. As mentioned before, 
the combination of symmetries is so restrictive that interactions leave the node invariant and this 
is in fact the case also for the excited states. Therefore all the excited states must contain the 
ground state fermionic node. This is illustrated in Fig. 7 which shows the groimd state (planar) 
node together with additional nodes, as expected for excitations. 

The excitations have therefore special symmetry structure which enables to show that the 
one-particle s channel is absent from the whole spectrum in this synmietry sector. One can 
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Fig. 7. The 3D projected nodes of a few selected excitations for the symmetry adopted CI expansion of 
the *5(p'^) ground state. The exact planar node of the quartet ground state is also possessed by all the 
excitations. The small spheres show the fixed positions of two electrons while the third one is scanning the 
nodal surface. Labels indicate the types of excitations. 

demonstrate this as follows. We expand any state of this symmetry in Unear combinations of 
excited determinants 



where the sum is over the complete one-particle spectrum (including scattering states). Due to 
the selection rules coefficients Cijk vanish for any excitation which contains one or more I = 
orbitals. Consider first states with = 0. That implies that all determinants in this configuration 
must have Ij = If^ otherwise the Clebsch-Gordan coefficients for S symmetry vanish. However, 
then the excitation would be of even parity and therefore such configurations drop off. The only 
remaining possibility is that li = Ij = 0. Such configurations then require that also 1^ — 0, 
however, such states with odd parity do not exist. Therefore the one-particle channel with s 
symmetry is "switched-off" completely. This property was useful for showing that the fixed-node 
calculation of this state provided the exact energy even when using nonlocal pseudopotential with 
s non-locality [92]. 

In order to illustrate how exact and approximate nodal surfaces affect the fixed-node DMC 
results we studied the valence-electron states of the spin-polarized nitrogen cation, some of which 
were analyzed above. The core electrons were eliminated by pseudopotentials [93]. The trial 
wave function was of the commonly used form with single HF determinant times the Jastrow 
correlation factor [5]. As we explained above, the pseudopotential nonlocal s— channel does 
not couple to either the odd parity S state or the even parity P{p^) state so that the nonlocal 
contribution to these states vanishes exactly. 

In order to compare the fixed-node QMC calculations with an independent method we have 
carried out also configuration interaction calculations with ccpV6Z basis [94] (with up to three 
g basis functions), which generates more than 100 virtual orbitals in total. In the CI method 
the wave function is expanded in excited determinants and we have included all single, double 
and triple excitations. Since the doubles and triples include two- and three-particle correlations 




(103) 



i,3,k 
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Tab. 1. Total energies (in Hartrees) of N"*", N"*"^ and N"*"^ ions with core electrons eliminated by pseu- 
dopotentials [93]. The energies are calculated by HF, configuration interaction (CI), variational (VMC) and 
fixed-node diffusion Monte Carlo (DMC) methods. 



State 


HF 


CI 


VMC 


DMC 




-5.58528 


-5.59491 


-5.59491(2) 


-5.59496(3) 




-7.24716 


-7.27566 


-7.27577(1) 


-7.27583(2) 




-8.98570 


-9.02027 


-9.01819(4) 


-9.01962(5) 



exactly, the accuracy of the CI results is limited only by the size of the basis set. By comparison 
with other two- and three-electron CI calculations we estimate that the order of magnitude of the 
basis set CI bias is Q.Ol mH for two electrons and « 0. 1 mH. and for three electrons (despite 
the large number of virtuals the CI expansion converges relatively slowly [95] in the maximum 
angular momentum of the basis functions, in our case Imax = 4). The pseudopotentials we used 
were identical in both QMC and CI calculations. 

The first two rows of Tab. 1 show the total energies of variational and fixed-node DMC calcu- 
lations with the trial wave functions with HF nodes together with results from the CI calculations. 
For ^P{p'^) the energies agree within a few hundredths of mH with the CI energy being slightly 
higher but within two standard deviations from the fixed-node QMC result. For ^S{p^) the CI 
energy is clearly above the fixed-node DMC by about 0.17 mH which is the effect of the finite 
basis. In order to illustrate the fixed-node bias in the case when the HF node is not exact we have 
also included calculations for four electron state ^S{sp^) (for further discussion of this Hartree- 
Fock node see Sec 3.4.2 below). For this case, we estimate that the CI energy is above the exact 
value by « 0.3 mH so that the fixed-node energy is significantly higher than both the exact and 
CI energies. Using these results we estimate that the fixed-node error is « 1 mH, i.e., almost 3% 
of the correlation energy. 



3.4 Approximate Hartree-Fock Nodes 

It is quite instructive to investigate the nodes of half-fiUed sub-sheUs of one-particle states with 
higher angular momentum. We will explore the nodal hypersurface projections into the 3D space 
to understand what an individual electron "sees" during the imaginary time evolution. The study 
also helps to classify nodes for various cases of atomic and molecular systems. For the next few 
subsections we will closely follow the derivations from our paper [92]. 

3.4.1 Hartree-Fock Node of ^S{d^) State 
The HF determinant wave function for ^S{d^) is given by 

5 

^HF = JJp(ri)det[222 _ _ y2^^2 _ y"^ ^xz,yz,xy], (104) 

i=l 
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Fig. 8. The 3D projected Hartree-Fock node of ^S{d^) state, which is an elliptic cone (left and right 
pictures). The picture in the center shows a configuration of two pairs of electrons lying on two orthogonal 
planes which contain the origin. This particular node is of lower dimension because of the additional 
constraint on positions of the electrons. It appears as a crossover between the elliptic cones with different 
orientation (left and right pictures). The small spheres show the positions of the four electrons while the 
line denotes the axis. 



where p{ri) is the radial part of the Sd-orbital. Since the radial prefactor is irrelevant it can be 
omitted. The S symmetry allows to rotate the system so that, say, electron 1 is on the z-axis, 
and then the corresponding column in the Slater matrix becomes (2z^, 0, 0, 0, 0). Assuming that 
zi 7^ we can then write the nodal condition as 

det[x'^ — y'^ , xz,yz, xy] = 0. (105) 

Using one of the electrons as a probe (i.e., looking at the node from the perspective of one of the 
electrons) we can find the projection of the node to the 3D space. We denote the probe electron 
coordinates simply as (x, y, z) and expand the determinant so that we get 

{x^ — y'^)'mi + xzm2 + yzm^ + xym^ — (106) 

where are the corresponding cofactors. We divide out mi assuming that it is non-zero (not a 
crucial assumption as clarified below). We get 

(a;^ — y^) + axz + byz + cxy — 0, (107) 

where a = 1712/1711, b = m^/mi, c — mi/ mi. By completing the square this can be further 
rearranged to 

{x - kiy){x - k2y) + z{ax + by) = Q (108) 

with fci.2 = (— c ± Vc^ + 4)/2. Let us define rotated and rescaled coordinates 

u* = -{ak2-b){x-kiy)/{ki-k2) (109) 
V* ^{aki~b)[x-k2y)/{ki-k2) (110) 
w* = z[(afci - b){ak2 - b)]/{ki - fca)^ (111) 
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Fig. 9. The 3D projection of the nitrogen cation ^S{sp'^) Hartree-Fock node (the core electrons are 
eliminated by pseudopotentials). The projected node exhibits two topologies. It is either a planar surface 
deformed by the radial orbital functions at the nucleus or, in certain configurations, the deformation forms 
a small bubble detached form the surface (the picture on the right). The small cross is the location of the 
ion while the small spheres denote positions of electrons. 



so we can write the Eq. (107) as 



Note that this equation has a form which is identical to Eq. (106) with mi = so this represen- 
tation is correct for any mi. After some effort one finds that Eq. (112) is a cone equation (i.e., 
orbital) as can be easily verified by using the following identity 



where u = u* +v* + 2w* , v — {—u*+v*+2w*), w — {u* —v* + 2w*). The 3D projected node 
is therefore an elliptic cone. Note that its equations is the same as rotated and rescaled angular 
part of the (1^2 orbital, — x'^ — y^. 

Therefore the 3D projection of the HP node is a family of cones given by the homogeneous 
second-order polynomial in three variables with coefficients determined by the positions of the 
four electrons. From the theory of quadrics [96], one finds that a general elliptic cone can 
possibly "fit" up to five 3D points/electrons. However, in our case there is an additional constraint 
of one electron being fixed on the z-axis. That implies that the z-axis Ues on the cone so that the 
cone always cuts the xy plane in two lines, which are orthogonal to each other The orthogonality 
can be verified by imposing z = in Eq. (108) and checking that kik2 — —1. In addition, one 
can find "degenerate" configurations with two pairs of two electrons lying on orthogonal planes 
(Fig. 8). This corresponds to the "opening" of the cone with one of the elliptic radii becoming 
infinite and the resulting node having a form of two orthogonal planes (Fig. 8). The condition 
is equivalent to A44 — iP' ~ a? ~ abc — 0, where A44 is one of the quadratic invariants [96]. 
There are more special cases of nodes with additional restrictions: (a) when two electrons lie on 
a straight line going through the origin; (b) when three electrons lie on a plane going through the 
origin; (c) when four electrons lie on a single plane. 



u*v* + w*u* + w*v* — 0. 



(112) 



{2u^ - - w^)/8 



— u*v* + w*u* + w*v* , 



(113) 
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3.4.2 Hartree-Fock Nodes of the ^S{sp^) Ion 

The HF node for this spin-polarized state with Is^ (He-core) states replaced by pseudopotentials 
can be studied in a similar way as studied previously [92]. We can expand the determinant 
as in the previous cases with a new feature that the radial parts will modify the shapes of the 
hypersurfaces. The system therefore corresponds, e.g., to the N+ ion from Tab. 1. Note that 
due to pseudized core the one-particle s orbital has no radial node although it corresponds to 
the physical 2s state. By expanding the determinant in the colunon of the probe electron with 
position X, y, z the 3D node projection is simply given by 

x + b'y + c'z + d'r]{r) = 0, (1 14) 

where b' ,c' , d! depend on ratios of cofactors and ri{r) = ps (r) / Pp{r) is the ratio of radial parts of 
s and p orbitals. The probe electron will see a plane with a approximately bell-shape deformation 
in the area of the nucleus (see Fig. 9). The shape of deformation depends on the ratio of s and 
p radial parts and the magnitudes and signs of the cofactors. For certain configurations the 
deformation in a detached, separate ellipsoid-like bubble. The bubble is caused by the radial 
dependence of 77 (r) which for pseudized core is not a monotonic function and therefore can 
create new topologies. Note that despite the fact that the 3D projection shows a separated region 
of space (the bubble) the complete node which is 11 -dimensional has again the minimal number 
of two nodal cells. This can be checked by placing the four particles at the vertices of a regular 
tetrahedron with the center at the ionic center The wave function does not vanish for such 
configuration as can be easily verified. Rotations by 7r/3 around of each of the four three-fold 
symmetry axes of the T4 group then shows that all the particles are connected. 

3.4.3 Hartree-Fock Nodes of Spin-Polarized p^d^ and sp^d^ Shells with S Symmetry 

Let us for a moment assume a model wave function in which the radial parts of s,p, d orbitals 
are identical. Then, using the arrangements similar to d^ case, we can expand the determinant of 
p'^d^ in one column and for the 3D node projection we then get 

2u'^ - v"^ - w"^ + au + I3v + 'yw = 0, (115) 

where u, v, w are appropriate hnear combinations of x, y, z. This can be further rewritten as 

2{u + a/4)2 -{v- /3/2)2 - {w - j/2f + So = 0, (1 16) 

where i5o = + + 7^)/4. It is clear that the quadratic surface is offset from the origin 

(nucleus) by a vector normal to au + f3v + jw = plane. Using the properties of quadratic 
surfaces one finds that for [c)? j(c? +/3^+7^)) < 2/3 the node is a single-sheet hyperboloid 
with the radius \fSQ\ otherwise it has a shape of a double-sheet hyperboloid. The double-sheet 
hyperboloid forms when there is an electron located close to the origin. A special case is a cone 
which corresponds to ((5o = 0). The case of sp^dc' is similar, but with different ^o> which now 
has a contribution from the s-orbital (see Fig. 10). Once we include also the correct radial parts 
of orbitals in the s,p, d channels the coefficients of the quadratic form depend on both cofactors 
and orbital radial functions. The resulting nodal surface is deformed beyond an ideal quadric and 
shows some more complicated structure around the nucleus (see Fig. 1 1) as illustrated on HF 
nodes of the majority spin electrons in Mn+^ ion (note that the Ne-core electrons were eliminated 
by pseudopotentials). 
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Fig. 10. The 3D projection of the angular part of the ^'^ S{sp'^cf') state Hartree-Fock node (with radial parts 
of orbitals identical for all spd orbitals). The projection has a topology of a single-sheet or double-sheet 
hyperboloid. The small cross shows the location of the nucleus while the spheres illustrate the electron 
positions. 




Fig. 11. Projected Hartree-Fock node of S{sp'^d^) of the majority spin valence electrons in Mn"^^ ion. 
The Ne-core electrons are eliminated by pseudopotentials. Note the deformations from the radial parts of 
orbitals, including a small bubble detached from the rest of the surface (the right picture). The green cross 
indicates the nucleus and the red crosses the electron positions. 
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3.5 Changes in Nodal Topologies From Interactions and Spin Correlations 

We conjecture that a non-degenerate ground state of any given symmetry possesses only two 
maximal nodal cells. It was demonstrated for the considered fully spin-polarized systems that 
the corresponding HF wave functions have the desired topology, i.e., two maximal nodal cells. 
On the other hand, for partially spin-polarized and unpolarized systems the corresponding HF 

wave functions of the form 



lead to at least four nodal cells due to the product of two determinants each with two cells. As it 
turned out, this is correct only for special cases such as non-interacting systems with occupations 
in both spin channels and some separable models. In general, such structure is an artefact. It 
appears since two different spins and corresponding antisymmetry requirement on configuration 
subspaces is essentially a special non-uniform symmetry requirement on the wave fiinction. We 
will see below that requirements of additional or special symmetries could lead to the increase of 
nodal counts even in the spin-polarized cases. Interestingly, generic interactions have the effect 
of smoothing out the artificial and redundant nodal structures so that the minimal two nodal cell 
property is restored. This is obvious since it is not clear right away that interactions would drive 
the nodal topology changes, in particular, the changes of this type. We will illustrate this on a 
few examples. 

Let us first analyze the simplest spin-polarized case of three electrons in a central, radi- 
ally symmetric potential. Consider the lowest atomic quartet of S symmetry and even parity is 
^S{ls2s3s). In the non-interacting linnit this state exhibits six nodal cells. The reason is that 
all the single-particle orbitals depend only on radii and therefore the nodes have effectively one- 
dimensional character. We will sketch how interactions change the topology and the number of 
nodal cells decreases to minimal two. In the recent paper [88] we showed this numerically for 
the Coulomb potential. For simplicity, we assume a simpler model of harmonically confined 
electrons while using the same symmetries of states and one-particle orbitals as for the Coulomb 
case. The harmonic oscillator has the advantage that the exponential Gaussian prefactor is the 
same for all states and one-particle orbitals irrespective of the occupied sub-shells. As before, we 
omit the prefactors in all expressions since they are irrelevant for the nodes. The non-interacting 
(or HF) wave function then becomes simply 



since (p2s and are first and second order polynomials in r^, respectively. It is clear that there 
are six nodal domains, because ^hf vanishes whenever rj = rj and there are six permutations 
of ri, r2, rs. Now, we can switch-on the interactions and describe the resulting correlations by 
expanding the wave function in excitations. The excitations into s-orbitals with higher quantum 
numbers do not change the nodal structure and their contributions to the exact wave function 
are actually very small. The dominant excited configuration is ^S{ls,2p, 3p), which captures 
angular correlations in the outer shells pair. We write the correlated, two-configuration wave 
function as 




(117) 



*/fF(ri,r2,r3) = (r| 



(118) 



* = det[(t)u,<f>2s,<t>3s] 

+ ^{det[(f>is, (/)2px, hpx] + {x^y) + {x^ z)} 



(119) 
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where /3 is the expansion coefficient. (This effect can be equivalently formulated also as a Pfaffian 
with triplet pairing). Let us now try to carry out the triple exchange. For this purpose we position 
particles as follows: ri = [0,ra,0], r2 = [0,0,0] and = [rb,0, 0] where Va > H > 0. 
During the exchange the particles first move as follows: 1 — > 2, 2 — > 3 and 3^- [r„ . 0,0]. 
The exchange is then finished by rotating particle 3 by +n/2 around the z— axis to the position 
[0, Ta, 0], completing thus the repositioning 3^1. The particle 1 moves to the origin along the 
y— axis while particles 2, 3 move along the a;— axis while their radii are such that > is 
maintained during the exchange. Clearly, during the exchange the node of non-interacting wave 
function will be crossed twice: when n = and again when n = r^. We will now show 
that for the correlated wave function the exchange can be carried out without crossing the node. 
Using the expressions for the states of harmonic oscillator (f)2px = x and 03^3, = x{r^ — 1) and 
substituting the coordinates as outlined above, one finds that the wave fimction becomes 

^ = (4- rMr^ - rDirl - rf) + ^x^x^] (120) 

We assume that (3 > 0. (In the case /3 < 0, it is straightforward to modify the positions and 
exchange path accordingly: the initial position of the particle 3 should be [—H, 0, 0], it moves to 
[—Ta, 0, 0] and it is then rotated by — 7r/2 around the z-axis.) Typical value of (3 is of the order 
of 10~^ which reflects the fact that the same spins (triplet) correlations are usually very small. 
After a Uttle bit of analysis one finds that this function does not change the sign provided the 
difference between the radii r2 and rs when ri = (r2 + i'3)/2 is sufficiently small. Then the 
following inequality holds 

\r2-r3 + 0[{r3-r2f]\<p, (121) 

which is easy to guarantee with some care when moving the particles along the exchange path. 
The minimum of the radial part appears essentially at the point ri = {r2 + r^) /2 and the value 
is negative. However, the other term keeps the wave function positive providing the negative 
term is not very large. Since the term \r2 — ^3 + 0[(r3 — r2)^] | can be made arbitrarily small by 
adjusting radii r2 , to be sufficiently close to each other one can easily fulfill the inequality. The 
wave function then remains positive along the whole exchange path. The nodes of uncorrelated 
and correlated wave function are plotted in Fig. 12. 

Similar topological change occurs in spin-unpolarized systems. The simplest possible case is 
the Be atom ground state with the electronic state ^S'[l,s^2,s^|. The wave-function can be written 
as a sum of HF and correlation components, i.e., ^' = "S^hf + ci^corrt where a is an expansion 
coefficient. The HF part is given as 

= det^[0i„ </)2,]det^[(/.i„ 02.], (122) 

while the dominant correlating configuration is the 2s^ 2p'^ excitation 

^corr = det^[(l)u, ^2px]<iet^[^is, (t>2px] + {x ^ y) + {x ^ z). (123) 

Clearly, the wave function has the required S symmetry and it is straightforward to show that 
the wave function has only two nodal cells. Consider a plane passing through the origin and 
configure the two spin-up electrons at antipodal points on a circle with radius Va > 0. Place the 
spin-down electrons on the same plane at antipodal points of a circle with different radius, say. 
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Fig. 12. The HF node and correlated node for the ^5'(ls2s3s) central potential state. Note the change in 
topology after the inclusion of the correlation through the dominant additional configuration. The figure on 
the left plots the complete HF node since it depends only on the three radii. The figure on the right shows 
the node only on 3D subspace, there are more nodal "openings" in other dimensions. 



r-f, > and rf, 7^ r^. In this configuration the HF wave function vanishes since particles lie on 
the Hartree-Fock node. In general, however, the correlated wave function does not vanish since 
we have 

* = Cr2i-r43, (124) 

where C is a symmetric non-zero factor. We can now rotate all four particles in the common 
plane by tt and the wave function remains constant. The rotation exchanges the two spin-up 
electrons and the two spin-down electrons simultaneously. This shows that the spin-up and - 
down nodal domains are connected otherwise we would have encountered zeros. Note that doing 
the rotation in one of the spin-channels only, i.e., the two-particle exchange for that spin, would 
necessarily result in hitting the node. The correlation therefore enables the four-particle exchange 
to avoid the node and the example provides an interesting insight into the many-particle concerted 
exchange effects. Similar behavior can be found in homogeneous periodic gas and again can be 
generalized to arbitrary sizes [89]. In the periodic systems one can show that these effects enable 
to wind singlet pairs around the periodic cell without encountering the node providing the path 
is chosen appropriately and also that similar effects is impossible for HF wave functions [89]. 

Multi-particle exchanges are closely related to quantum condensates such as Bose-Einstein 
condensation in bosonic systems or superconductivity in fermionic systems. The interpretation 
of condensates as systems which exhibit macroscopic chains of quantum exchanges goes back 
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to Feynman. In bosonic systems this does not help much and even looks "trivial" since there are 
no nodes to begin with and the emergence of condensates is driven by bosonic correlations. In 
fermionic systems the correct nodal topologies also appear only as a necessary but not sufficient 
condition for the condensation to appear. However, from this point of view it is important that the 
Bardeen-Cooper-Schrieffer wave functions, which will be described later, do exhibit the correct 
nodal topologies as shown elsewhere [88, 89]. 




Fig. 13. The 3D projected nodes for N2 dimer. The HF nodes (left) are bended and distorted due to spin 
correlations into CI nodal surface (right) with just two nodal cells. Sampling of nodes is performed with a 
pair of electrons with opposite spins, which are close to each other; positions of other electrons (crosses) 
are fixed; small spheres indicate ions. 



3.5.1 HF and CI Nodes for N2 Dimer State ^ S+ 

The projected nodal structure of the spin-unpolarized ground state for N2 dimer is shown 
in Fig. 13. The He-core electrons are eliminated by pseudopotentials as in previous calcula- 
tions [93]. The HF wave function with the separation of electrons into independent spin-up and 
spin-down subspaces forms fours nodal cells. In the CI case we can see that the HF nodes have 
been distorted and bended to build up channels, which connect regions of the same sign with 
resulting two maximal nodal cells. 

In order to illustrate the accuracy of the constructed CI nodes we have carried out the fixed- 
node DMC simulations with both HF and CI trial wave functions. For the HF trial wave function 
we have obtained the total energy -19.8395(7) H, which recovers « 94% of correlation energy. 
For the CI trial function with more than 5000 determinants from double excitations into 45 virtual 
orbitals with ccpV6Z basis (with up to / basis functions) the total energy reaches -19.870(5) H, 
recovering thus w 98% of correlation energy. The estimated exact energy is -19.8822 H. It is 
clear that despite the extensive determinantal expansion the fixed-node error is still visible. It is 
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just another demonstration that convergence of such correlated wave functions is slow and more 
sophisticated approaches are needed to reach beyond the limits of the determinantal expansions. 

3.6 Conclusions 

We reviewed some of the recent advances in understanding the general properties of nodes 
of fermionic wave functions. One of the key findings is that the nodal topologies of generic 
fermionic ground states appear to be remarkably simple. For d > 1 the node bisects the configu- 
rations space with the resulting nodal domains being connected. Intuitively, this is not completely 
unexpected since presence of any node raises the wave function curvature, i.e., the kinetic en- 
ergy. Therefore the energy minimization leads to decreasing both the nodal "volume" and also 
the corresponding curvatures. We have also seen that by imposing additional symmetries one can 
increase the nodal domain counts. What is less obvious that even in such cases the interactions 
tend to smooth out the nodal surfaces and fuse the multiple domains into the minimal number. 
This happens for both spin-polarized and unpolarized systems as we have shown on few-particle 
examples with correlated wave functions. 

Comparison of bosonic and fermionic ground states from the viewpoint of nodal topologies 
reveals an interesting perspective. It is well-known that generic bosonic ground states are node- 
less and so that one can say that they represent "global S-waves". On the other hand, the bisection 
of the configuration space suggests that generic fermionic ground states are "global P-waves". 
This is a remarkably beautiful result of the interplay between continuous space and symmetries: 
it shows that Nature has chosen the simplest topologies to fulfill the fundamental symmetries. 

Furthermore, analysis of QMC data also reveals that much of the correlation in fermionic 
systems has actually bosonic nature, i.e., particle correlations within the nodal domains such 
two-electron cusps, multi-particle even parity exchanges, etc. This gives a rather prominent role 
to the fixed-node DMC method which recovers such bosonic correlations exactly. Comparison 
of fixed-node results with estimations of exact total energies (from other methods for small num- 
ber of electrons and from experiments for extended systems) show that this is indeed the case. 
Calculations of atoms, molecules, clusters, solids, quantum liquids, etc, shows that the amount 
of recovered correlation is similar essentially in all cases and reaches 90-95% of the total corre- 
lation energy irrespective of the type of the system. This is very high when compared with other 
approaches based on explicitly correlated wave functions. In fact, for extended systems, there 
is no other method which can achieve such accuracy. Indeed, this is a significant outcome of 
the QMC efforts over the years and it is also one of the key reasons why these approaches are 
becoming more common. The achieved level of accuracy now enables to predict many "robust" 
quantities, i.e., which are larger than 0.1-0.2 eV, such as cohesions, optical gaps, etc. 

On the other hand, the effects which are beyond the fixed-node approximation still leave 
many important and exciting problems unsolved or unreachable. Subtle effects such as magnetic 
ordering, presence of new quantum phases, happen at scales of tens of meV or smaller and, con- 
sequently, can be significantly affected by the fixed-node bias. A limited understanding of many 
of these aspects is still the main reason why the progress has been very gradual albeit steady over 
the years. Although the origin of these errors is exclusively in inaccuracies of the nodal shapes at 
present there are no clear-cut solutions which would enable to describe these in an efficient and 
general manner. Also how much and which types of nodal errors can influence interesting quan- 
tum phenomena is not yet understood. One possible path forward is to get better handle of the 
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nodal variational freedom using more effective nodal representations or new constructions which 
will address this particular challenge. It seems that this is out of reach of traditional approaches: 
common correlated wave function constructions, such as expansion in excited determinants, ap- 
pear to be very slowly converging and are therefore inefficient, limiting the applications to small 
sizes. Such constructions are essentially non-existent for extended systems such as solids or 
surfaces. Some of the recently explored ideas in this direction will be presented below. Another 
possibility to overcome this fundamental obstacle might come from better use of statistical meth- 
ods to deal with the nodal inaccuracies. In this respect, we mentioned some approaches in the 
previous section and there are ongoing efforts by us and others in exploring ideas in this research 
area as well. Finally, "brute force" approaches, such as release node methods, are not necessarily 
useless and could be useful for benchmarks in smaller and intermediate size systems. 
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4 Pfaffian Pairing Wave Functions 



4.1 Introduction 



As we mentioned, commonly used trial wave functions are based on one-particle orbitals and 
Slater-Jastrow forms. In order to overcome this one-particle based constructions, pair orbitals 
have been previously tried in various frameworks. In condensed systems one such example 
is the Bardeen-Cooper-Schrieffer (BCS) wave function, which is an antisymmetrized product 
of singlet pairs. In quantum chemistry, the pair orbital is sometimes referred to as geminal 
and the resulting wave function as the antisymmetrized geminal product (AGP). It has been 
recently used to calculate several atoms and molecules as well as superfluid ultracold Fermi 
gases [85, 86, 97]. The results show promising gains, nevertheless, for partially spin-polarized 
systems the improvements are less systematic due to the lack of pair correlations in the spin- 
polarized subspace [85, 86]. The spin-polarized (triplet) pairing wave functions naturally lead to 
Pfaffians, which have been previously applied to model systems [98-100]. 

In this section, we further develop ideas from Sec. 2.4.2, in which we have introduced the 
Pfaffian form. Sec. 4.2 presents mathematical identities and expressions associated with Pfaffi- 
ans, some of them derived for the first time. In Sec. 4.3 we establish relationships between the 
Pfaffian, BCS and HF wave functions. These forms are tested on atomic and molecular systems 
in variational and fixed-node DMC OA methods as described in Sec. 4.4.1. In Sec. 4.4.2 we 
investigate generaUzations to linear combinations of Pfaffians and compare the results with other 
methods in regard to to wave functions compactness, the wave functions. In Sec. 4.4.3 we ana- 
lyze the topological differences between HF, Pfaffian and an essentially exact wave functions for 
a given test example. 



First introduced by Cayley in 1852 [101], the Pfaffian is named after German mathematician 
Johann Friedrich Pfaff. Given a 2n x 2n skew- symmetric matrix A = [aij], the Pfaffian of A is 
defined as antisymmetrized product 



pf[A] = A[ai^2a3,4, ■ ■ ■ a2n-l,2n] 



4.2 Algebra of Pfaffians 



Here we will closely follow our exposition of Pfaffian algebra from Ref. [2]. 



4.2.1 Definitions 




(125) 



a 



where the sum runs over all possible (2n— 1)!! pair partitions a = {{ii, ji), {i2, j2), ■ ■ ■ , (imjn))} 
of {1,2,..., 2n} with ik < jk- The sign of permutation associated with the partition a is de- 
noted as sgn(Q!). The Pfaffian for a matrix of odd order equals to zero. The following example 
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gives Pfaffian of a ^4(4 x 4) skew-symmetric matrix 








ai2 


Ol3 


ai4 


pf 


-ai2 





^23 


0-24 


-ai3 


-a23 





034 




-ai4 


-a24 


-(134 






= CH2<l34 — Cfl3<l24 + <ll4(l23- 



It can be also evaluated recursively as 

2n 

[^] = J2 sgn(aij) Oi, 

2n 

= Yai,jPc{ai,j), 

3=2 



(126) 



(127) 



where aij is partition with ik,jk 7^ 1) j and Pc(«i,j) is defined as Pfaffian cofactor of aij. The 
cofactor for an element Uj^k is given by a formula, 



Pc(a„fc) = (-ir+'=+Vf[^(.7,fc;j,fc)] 



(128) 



where the matrix A{j, k;j, k) has the rank 2(n — 1) x 2(n — 1) and is obtained from A by 
eliminating j and fc rows and colunms. 



4.2.2 Calculation of a Pfaffian 

There exist several identities involving Pfaffians and determinants. For any 2n x 2n skew- 
symmetric matrix A and arbitrary matrices B(2n x 2n) and M{n x n) we have the following 
relations: 

Ai " 

^2 

M 




pf 



pf 



(-l)>f[A] 


(129a) 


dct[^] 


(129b) 


pf[Ai]pf[A2] 


(129c) 


det[-B]pf[A] 


(129d) 


(-1) '2 *det[M]. 


(129e) 



Let us comment on these identities and point out to respective proofs as appropriate: 

(129a) Each permutation contains product of n pairs resulting in an overall (—1)" factor. 

(129b) This is a well-known Cayley's relationship between the Pfaffian and the determinant of a 
skew-symmetric matrix. This is a well-known relationship which has been proved many 
times in variety of ways [102-104]. 



(129c) Use the expansion by Pfaffian cofactors. 
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(129d) By squaring (4d), using Eq. (129b), and taking the square root one finds pi[BAB^] = 
±det[i?]pf [A]. Substituting the identity matrix / for B one finds + to be the correct sign. 

(129e) Assume 



in Eq. (129d). The overall sign is given by value of pf [A]. 

The identities Usted above imply several important properties. First, Eqs. (129d) and (129e) 
show that every determinant can be written as a Pfaffian, but on the contrary, only the absolute 
value of Pfaffian can be given by determinant [Eq. (129b)]. The Pfaffian is therefore a generalized 
form of the determinant. Second, by substituting suitable matrices [105] for M in Eq. (129d) one 
can verify the following three properties of Pfaffians [106], similar to the well-known properties 
of determinants. 

(a) MultipUcation of a row and a column by a constant is equivalent to multiplication of the 
Pfaffian by the same constant. 

(b) Simultaneous interchange of two different rows and corresponding columns changes the 
Pfaffian sign. 

(c) A multiple of a row and a corresponding column added to another row and corresponding 
column leaves the Pfaffian value unchanged. 

It is also clear that any skew-symmetric matrix can be brought to the block-diagonal form by an 
orthogonal transformation. Recursive evaluation [Eq. (127)] implies that the Pfaffian of block- 
diagonal matrix is directly given by 




and 




■ Ai 
-Ai 







pf 



As 
-A2 



— A1A2 . . . A„. 



(130) 









-A. 



A. 







Therefore by employing a simple Gaussian elimination technique with row pivoting (see Ref. [107]) 
we can transform any skew-symmetric matrix into block-diagonal form and obtain its Pfaffian 
value in O(n^) time. 
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However, in QMC applications, one often needs to evaluate the wave function after a single 
electron update. Since Cayley [104] showed (for the proof see [2]) that 



det 






bl2 


^13 


■ • bin 


-ai2 





023 


■ ■ a2n 


-ai3 


-023 





■ ■ 03n 


— din 


— a2n 


— 0-371 ■ 


.. 



pf 
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-ai3 
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-023 



ai3 

023 
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(131) 






bi2 


bl3 


• bin 


-bi2 





023 


■ 02ra 


-bi3 


-023 





• • 03„ 


-bin 


— 02ti 


— OSn • 


.. 



we can relate the Pfaffian of original matrix pf[A] to the Pfaffian of a matrix with updated first 
row and colunon pf [-B] using the inverse matrix in only 0{n) operations by 



pf[B] 



det[A] E,- bijA 



pf[A] 



(132) 



The second part of Eq. (132) was obtained by taking advantage of the identity in Eq. (129b). 
Slightly more compUcated relation between pf [A] and pf [-B] can be derived if one considers 
simultaneous change of two separate rows and colunms, which represents the two electron update 
of a wave function. 



4,2,3 Gradient and Hessian of Pfaffian 

In optimization routines, it is often favorable to calculate the gradient and Hessian of wave func- 
tion with respect to its variational parameters. Given the matrix elements A dependent on a set 
of parameters {c}, one can derive the following useful relations: 



and 



1 9pf[A] ^ 1 

pf[A] da 2 ' 



A 



.idA 
dci 



(133) 



1 a2pf[A] _1 



pf[A] dcidcj 



\-idA- 




\-idA- 




tr 




dci 


9c j_ 



-tr 



^-.9A^-.dA 



dci 



dc-i 



-tr 



A- 



, d^A 
dcidc-i 



(134) 



where A ^ is again the inverse of A. 
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4.3 Pairing Wave Functions 

Let us now consider the generalization of the one-particle orbital to a two-particle (or pair) orbital 
<p{i,j), where tilde again denotes dependence on both spatial and spin variables. The simplest 
antisymmetric wave function for 2A'' electrons constructed from the pair orbital is a Pfaffian 

* = ^[^(1, 2), 0(3, 4) . . . 0(27V - 1, 2N)] = pf[0(i,.7)]. (135) 

The antisymmetry is guaranteed by the definition (125), since the signs of pair partitions alter- 
nate depending on the parity of the corresponding permutation. The important difference from 
Slater determinant is that in the simplest case only one pair orbital is necessary. (This can be 
generahzed, of course, as will be shown later.) If we further restrict our description to systems 
with coUinear spins, the pair orbital (^(r^, (7^; rj,aj) for two electrons in positions rj and and 
with spins projections cr, and aj can be expressed as 

4>{r„a,;r,,aj) = 0(z, j)(a,a,|[| U) - I it)]/V2 (136) 

+x^Hi,j)M[\n) + \m/V2 

Here j) = (/'(r^, r^ ) is even while x^^, X^^ and x"^"^ are odd functions of spatial coordinates. 
In the rest of this section we will discuss special cases of the wave function (135). 



4.3.1 Singlet Pairing Wave Function 

Let us consider the first 1, 2, iV electrons to be spin-up and the rest N + 1, 2N electrons 
to be spin-down and allow only ^(rj, r^) in ^(rj, cTj; r^, aj) to be non-zero. Using the Pfaffian 
identity [Eq. (129e)], we can write the wave function for singlet pairs, also known as the 
Bardeen-Cooper-Schrieffer (BCS) wave function (or AGP), in the following form 

= det [*''"■'-], (137) 

which is simply a determinant of the N x N matrix ^'^^ = [<p{i,j)] as was shown previ- 
ously [108, 109]. 

Let us comment on the BCS wave function which has its origin on the theory of supercon- 
ductivity [110]. The original form is actually for variable number of particles, since the theory 
is formulated in the grand canonical ensemble — the number of particles adjusts to the chemical 
potential. Hence, if the grand canonical form is projected onto the state with fixed number of 
particles, its form is then given by Eq. (137). 

It is straightforward to show that the BCS wave function contains the restricted HF wave 
function as a special case. Let us define the Slater matrix C = [fiiJ)] where {ifi} is a set of HF 
occupied orbitals. Then we can write 

= det[C]det[C] = det[CC'^] = det[*^^^], (138) 
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where 

JV 

= <pHF{i,j) = ^<Pk(i)'Pk{j)- (139) 

k=l 

On the other hand, we can think of the BCS wave function as a natural generahzation of the HF 
one. To do so we write the singlet pair orbital as 

M>N 

ct>{i,j) = Sk,iMi}Vi{j) = <P{i}S<p{j), (140) 

k,l 

where the sum runs over all M (occupied and virtual) single-particle orbitals and S is some 
symmetric matrix. Therefore, we can define one-particle orbitals, which diagonaUze this matrix 
and call them natural orbitals of a singlet pair. 

The BCS wave function is efficient for describing systems with single-band correlations such 
as Cooper pairs in conventional BCS superconductors, where pairs form from one-particle states 
close to the Fermi level. 



4.3.2 Triplet Pairing Wave Function 

Let us assume, in our system of 2N electrons, that the first Mi electrons are spin-up and re- 
maining M2 = 2N — Ml electrons are spin-down. Further, we restrict Mi and M2 to be even 
numbers. Then by allowing only x^^(^) j) ^nd x^^ihj) (136) to be non-zero, we obtain from 
expression (135) by the use of Eq. (129c) 

r^^^ 



*TP = pf 



^ 



.4.4. 



pfK^ipf[ei, (141) 



where we have introduced Mi x Mi(Af2 x M2) matrices ^'^^^^^^ = [x^'^^^^\i, j)] ■ This result 
was previously found in a weaker form in Ref. [108, 109]. 

The connection to a restricted HF wave function for the above state can be again estabUshed 
as follows. In accord with what we defined above, dct[(C)'''(^'] are spin-up(-down) Slater deter- 
minants of some HF orbitals {(pi}. Then, by taking advantage of Eq. (129e) we can write 

= det[C''']det[C^] (142) 

_ pf[CUiC'''^]pf[C^A2C^^] 
~ pi[Ai]pi[A2] ' 

given Ai and A2 are some skew- symmetric non-singular matrices. In the simplest case, when 
Ai and A2 have block-diagonal form (130) with all values A, = 1, one gets 

*HF=pfK|,V]pf[dF]- (143) 
The pair orbitals can be then expressed as 

i^HF%,J=X^}l^''\i,j) (144) 

Mi(M2)/2 

= XI i'P2k-li^)f2k{j) - 'P2k-l{j)f2kii))- 
fe=l 
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Similarly to the singlet pairing case, the triplet pairing wave function appears as a natural gener- 
alization of the HF one. We can write the triplet pair orbitals as 

M>Mi(M2) 
k,l 

= V?(j)A^^(^^)¥>(j), (145) 

where again the sum runs over all M (occupied and virtual) single-particle orbitals and A^^^"^"^^ 
are some skew- symmetric matrices. Therefore we can define one-particle orbitals which block- 
diagonalize these matrices and call them natural orbitals of a triplet spin-up-up (down-down) 
pair. 



4.3.3 Generalized Pairing Wave Function 



Let us now consider a partially spin-polarized system with unpaired electrons. In order to in- 
troduce both types of pairing we allow x^^ih.l)' X.^^{hj) in (136) to be non-zero. 
However, we omit the x^^ihj) term. Then our usual ordered choice of electrons labels with all 
spin-up electrons first and remaining electrons spin-down enables us to directly write from (135) 
the singlet-triplet-impaired (STU) orbital Pfaffian wave function [1] 



^STU = Pf 



l-(p 



-f 



.IT 







(146) 



where the bold symbols are block matrices or vectors of corresponding orbitals as defined in 
Sections 4.3.1 and 4.3.2 and T denotes transposition. For a spin-restricted STU wave fimction 
the pair and one-particle orbitals of spin-up and -down channels would be identical. 

The Pfaffian form can accommodate both singlet and triplet pairs as well as one-particle 
unpaired orbitals into a single, compact wave function. The correspondence of STU, Pfaffian 
wave function to HF wave function can be estabUshed in a similar way for the pure singlet and 
triplet pairing cases. 

It is instructive to write down some illustrative examples of the introduced form for a few- 
electron atomic states. For example, for the Li atom with ^S'(ls'^2s) we can write 



*STi7 = pf 



e^t(i,2) </.n(i,3) ^T^(i) 

^^^2,1) </.t;(2,3) ^1,(2) 

cj>n(3,l) 0t4(3,2) cpl{3) 

-V'L(l) -VL(2) -<^i«(3) 



= e^ni,2)^t(3) - ^^^(1,3)(pL(2) +0^^(2,3)^L(1)- 



(147) 



It is easy to check that the expression has the correct antisyrmnetry. Also note that on can get the 
HF wave function by substitution 



(148) 
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or, alternatively, by setting 



(149) 



For the valence part of the carbon atom (we skip the He-core), we can write the state as ^P(2s^2p^) 
with the corresponding wave function 



^STU = Pf 



eHh2) eHh^) 0^^(i,4y 

eH2,l) Ctt(2,3) </,n(2,4) 

eH3,l) eHS,2) #H3,4) 

-#^(4,1) -#^(4,2) -(l)^^{4,3) 



(150) 



Finally, for the nitrogen atom valence electrons the quartet state given by ^5(2s^2p^) and the 
wave function can be written as 
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(151) 



4.4 Applications of Pairing Wave Functions in QMC 

In the remainder of this section, we would like to illustrate the applications of Pfaffian paring 
wave functions in QMC methods. As we have mentioned before in Sec. 2.4.2, the trial wave 
function used in QMC calculations by variational and fixed-node diffusion Monte Carlo is a 
product of an antisymmetric part times the Jastrow correlation factor: 

*t(R) = *a(R) eMUcorriinj}, {ni}, {rji})], (152) 

where Ucorr depends on electron-electron, electron-ion and, possibly, on electron-electron-ion 
combinations of distances. The antisymmetric part can be represented as '^a = '^hf and '9 a = 
^STU or 'i>A = ^Bcs- Further, the pair orbitals xihj) ^ind j) are expanded in products of a 
one-particle orbital basis [85] according to Eqs. (140) and (145). This approach was particularly 
convenient due to availability of one-particle atomic and molecular orbitals from either Hartree- 
Fock or CI correlated calculations. However, expansion of this type might not be the most 
optimal for homogeneous or extended periodic systems and other forms were also successfully 
employed [97]. For the applications below, we typically used about 10 virtual orbitals. The 
natural orbitals (see e.g. Ref. [22]) produced better and more systematic results than the HF 
ones. The pair orbital expansion coefficients were then optimized in VMC by minimizations 
of energy, variance or a combination of energy and variance (for details, see Sec. 7.2). The 
optimization procedure required the calculation of gradient and the Hessian of the wave function 
according to Eqs. (133) and (134). 
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4.4.1 Single Pfaffian Calculations 

The performance of single Pfaffian pairing wave functions in the STU form [Eq. (146)] was 
tested on several first row atoms and dimers [1]. The pseudopotentials [93, 111] were used to 
eliminate the atomic core electrons, except in Be atom case. 

The most important result is that STU wave functions systematically achieve higher percent- 
age of recovered correlation energy than single determinant wave functions in the fixed-node 
DMC method (see Table 2 and Fig. 14). Another significant result is that in general the triplet 
contribution for these single Pfaffian STU wave functions are small, perhaps the only exception 
being the nitrogen atom, where we see a gain of additional 1 % in correlation energy when com- 
pared to a trial wave function without triplet pairs. We believe, this is due to the fact, that ground 
state of nitrogen atom has a quartet spin state and therefore the highest spin polarization from 
all studied cases. Overall, the single Pfaffian form is capable of capturing near-degeneracies and 
mixing of excited states for both spin-polarized and unpolarized systems. 




System 



Fig. 14. Correlation energies obtained by QMC methods with the different trial wave functions: VMC and 
fixed-node DMC with HF nodes (HF) and STU Pfaffian nodes (PF). The lower plot shows the fixed-node 
DMC correlation energy gains over HF nodes for BCS and STU Pfaffian wave functions. The statistical 
error bars are of the symbol sizes or smaller. Except for the Be atom all the calculations used the same 
pseudopotentials [93, ill]. 
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Tab. 2. Total energies for C, N and O atoms and their dimers with amounts of the correlation energy recov- 
ered in VMC and DMC methods with wave functions as discussed in the text. Unless noted otherwise, the 
numbers in parentheses are the statistical errors in the last digit from corresponding QMC calculation. Ener- 
gies are in Hartree atomic units. For C, N, O atoms wc used the correlation energies by Dolg [1 12](0.1031, 
0.1303, 0.1937 H). For the estimation of correlation energies of dimers we needed accurate HF energies at 
experimental distances [113] and the estimated exact total energies. Each exact total energy was estimated 
as a sum of total energies of constituent atoms minus experimental binding energy [113-115] adjusted for 
experimental zero-point energy [115]. 
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DMC/HF 


-5.4061(3) 


88.6(2) 


-9.7496(2) 


92.6(2) 


-15.8421(2) 


94.8(1) 


DMC/BCS 


-5.4140(2) 


96.3(2) 


-9.7536(2) 


95.7(2) 


-15.8439(4) 


95.7(2) 


DMC/STU 


-5.4139(2) 


96.2(2) 


-9.7551(2) 


96.8(1) 


-15.8433(3) 


95.4(2) 


Est./Exact 


-5.417806 


100 


-9.759215 


100 


-15.85216 


100 


Method/WF 


C2 




N2 




O2 




HF 


-10.6604 





-19.4504 





-31.3580 





VIVIC/HF 


-10.9579(4) 


72.9(1) 


-19.7958(5) 


80.0(1) 


-31.7858(6) 


79.6(1) 


VIVIC/BCS 


-11.0059(4) 


84.7(1) 


-19.8179(6) 


85.0(1) 


-31.8237(4) 


86.7(1) 


VIVIC/STU 


-11.0062(3) 


84.8(1) 


-19.821(1) 


85.8(2) 


-31.8234(4) 


86.6(1) 


DIVIC/HF 


-11.0153(4) 


87.0(1) 


-19.8521(3) 


93.0(1) 


-31.8649(5) 


94.3(1) 


DMC/BCS 


-11.0416(3) 


93.5(1) 


-19.8605(6) 


94.9(1) 


-31.8664(5) 


94.6(1) 


DMC/STU 


-11.0421(5) 


93.6(1) 


-19.8607(4) 


95.0(1) 


-31.8654(5) 


94.4(1) 


Est./Exact'^ 


-11.068(5)^ 


100.0(10) 


-19.8825(6)'' 


100.0(1) 


-31.8954(1)'' 


100.0(1) 



"There is rather large discrepancy in the experimental values of C2 binding energy (141.8(9) [113], 143(3) [115] 
and 145.2(5) kcal/mol [1 14]). For the estimation of exact energy we have taken the average value of 143(3) kcal/mol. 
''Experimental binding energies taken from ref. [113]. 
"^The error bars on estimated exact total energies are due to experiment. 



4.4.2 IMulti-Pfafflan Calculations 

In order to capture tlie correlation energy missing from the single STU Pfaffian wave fimction 
and to test the limits of the Pfaffian functional form, we have also proposed to expand the in 
the linear combination of STU Pfaffian wave functions [1,2]. 

Following the approach adopted from the CI correlated calculations, accurate can be 
expressed as a linear combination of reference state ^0 and single ^^ and double^f^j excitations: 

N M N M 

^ciSD = co*o + E E + E E (153) 

i k ij kl 

where i-lh and j-th electrons are being excited into k and I virtual orbitals. Note that the the 
number of determinants in the expansion will be of the order of x M^. Analogously, we 



66 



M. Bajdich and L. Mitas 



Tab. 3. Percentages of correlation energies recovered for C, N and O atoms by VMC and DMC methods 
with wave functions as discussed in the text. The corresponding number of Pfaffians or determinants n for 
each wave function is also shown. For details, see caption of Table 2. 



MethodAVF 


n 


C 


n 


N 


n 


O 


VMC/MPF 


3 


92.3(1) 


5 


90.6(1) 


11 


92.6(3) 


VMC/CP 


98 


89.7(4) 


85 


91.9(2) 


136 


89.7(4) 


DMC/MPF 


3 


98.9(2) 


5 


98.4(1) 


11 


97.2(1) 


DMC/CP 


98 


99.3(3) 


85 


98.9(2) 


136 


98.4(2) 



"The determinantal weights were taken directly from CI calculation without re-optimization in VMC. 
postulate the multi-Pfaffian (MPF) wave function as a linear combination of STU Pfaffians: 

N N 
i ij 

where each cpij is the generalized paring orbital [Eq. (136)] containing aU possible exci- 
tations of i and j electrons. The resulting wave function will in general consist of only N'^ 
Pfaffians. Further, if the reference state is the most dominant state (i.e., cq '> c^j), it is 
possible to show by expanding cpij into orders of cfj/co that 

*MPF = ^ciSD + O {(c'lj/cof) . (155) 

In fact, the only difference between the and \1/c/sd wave functions is in presence of the 

higher order excitations, which are approximately present in the V^mpf- 

The mapping of the MPF wave functions onto equivalent CISD wave functions was also 
verified numerically in the variational and FN-DMC methods using the above first row atoms 
and molecules [1, 2]. The results in Table 3 show that for the atomic systems the MPF wave 
functions are able to recover close to 99% of correlation energy — very similarly to CISD wave 
functions, while requiring order less terms. The results for diatomic cases (see Table 4) show 
very similar behavior. The correlation energy recovered is on the order 98% with MPF wave 
functions closely matching the CISD wave functions — despite much richer electronic structure 
than in atomic cases. The comparison with the CI results therefore demonstrates that it is possible 
to obtain similar quality wave functions with corresponding improvements of the fermion nodes 
at much smaller calculational cost. 

4.4.3 Nodal Properties 

As we have already extensively discussed in Sec. 3, the fermion node is a complicated hyper- 
surface defined by an implicit equation '^a{R) — 0. Due to its complexity, we approximate 
the exact fermion node with the node of the best available trial wave function. However, any 
deviations from the exact node lead to the fixed-node DMC errors. It is therefore quite valuable 
to compare the nodes of a different accuracy. 
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Tab. 4. Total energies for C2 and N2 dimers with amounts of correlation energy recovered in VMC and 
DMC methods with wave functions as discussed in the text. Energies are in Hartree atomic units. The 
corresponding number of Pfaffians or determinants n for each wave function is also shown. For details, see 
caption of Table 2. 



Method/WF 


n 


C2 




n 


N2 


^corr l*^] 


VMC/MPF'= 


11 


-11.0402(1) 


93.1(1) 


16 


-19.8413(6) 


90.5(1) 


VMC/CP 


404 


-11.0409(3) 


93.3(1) 


535 


-19.8487(6) 


92.2(1) 


DMC/MPF'= 


11 


-11.0574(5) 


97.3(1) 


16 


-19.8670(8) 


96.4(2) 


DMC/CP 


404 


-11.0593(6)'' 


97.8(2) 


535 


-19.8713(5) 


97.4(1) 



"The determinantal weights were re-optimized in the VMC method. 

''Recently, Umrigar et.al. [84] published very accurate DMC result for fully optimized CI wave function with up to 
500 determinants for C2 molecule. The resulting well-depth of his calculation is 6.33(1) eV, which is only 0.03 eV 
form estimated exact value of Ref. [1 16]. The well-depth resulting from our DMC/CI energy of —11.0593(6) H equals 
to 6.08(3) eV. 

'^one type of triplet-like excitation not included. 




Fig. 15. A three-dimensional cut through the fermion node hypersurface of oxygen atom obtained by 
scanning the wave function with a spin-up and -down (singlet) pair of electrons at the equal positions, 
while keeping the rest of electrons at a given VMC snapshot positions (small green spheres). Nucleus is 
depicted in the center of the cube by the blue sphere. The three colors (from left to right) show nodes of: 
Hartree-Fock (red/dark gray); Multi-Pfaffian nodes (orange/medium gray); and the nodes of the CI wave 
function (yellow/light gray). The CI nodal surface is very close to the exact one (see text). The HF node 
clearly divides the space into four nodal cells while Pfaffian and CI wave functions partitioning leads to the 
minimal number of two nodal cells. The changes in the nodal topology occur on the appreciable spatial 
scale of the order of I a.u. 

The Fig. 15 shows the example of nodal structure of oxygen atom (Is^ electrons were re- 
moved by pseudopotential). Here we compare the nodal surfaces of HF (no pairing), MPF Pfaf- 
fian (STU pairing) and a high accuracy CI wave function with more than 3000 determinants, 
which gives essentially exact fermion nodes (i.e., 99.8(3)% of correlation energy in fixed-node 
DMC). From the figure, it is clear that the changes in the nodal surfaces are significant — the most 
important one being the elimination of artificial four nodal cells resulting from the independence 
of spin-up and -down channels in HF. The Pfaffian smooths-out the crossings and fuses the com- 
partments of the same sign into the single ones. These topology changes therefore lead to the 
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minimal number of two nodal cells (for details see Sec. 3). However, the MPF Pfaffian nodes 
are noticeably different close to nucleus and in the tails and the amount of missing correlation 
energy is still non-negligible. 

4.5 Conclusions 

The proposed Pfaffians with singlet and triplet pairs together with unpaired orbitals provides a 
variationally rich and compact wave functions. They offer significant and systematic improve- 
ments over commonly used Slater determinant-based wave functions and can be calculated as 
easily as determinants using the presented formulas. We have also shown relationships between 
HF, BCS(AGP) and Pfaffian wave functions. Clearly, Pfaffian pairing wave functions are able 
to capture a large fraction of missing correlation energy with consistent treatment of both spin- 
polarized and unpolarized pairs. We have explored Pfaffian expansions which enabled us to 
capture additional correlation and map them onto the equivalent CISD wave functions. Finally, 
the gains in correlation energy for Pfaffians come from improved fermion nodes, which are very 
close to the exact ones and exhibit the correct topology with the minimal number of two nodal 
cells. 
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5 Backfiow Correlations in Slater and Pfafflan Wave Functions 
5.1 Introduction 

Another approach for improvement of the nodal accuracy of variational trial wave functions is 

to employ backflow correlations. They were first introduced by Feynman and Cohen [57] as a 
simple ansatz to describe correlations and excitations in liquid ^He. Since then, a number of 
authors [55, 58-62] showed that the backflow correlations are useful also in fermionic systems. 
Recently, the backflow transformation of electron coordinates has been applied to chemical (e.i., 
inhomogeneous) systems [63, 64, 117, 118]. This section contains detailed description of in- 
homogeneous backflow transformation for multi-determinantal and Pfafflan wave functions and 
their applications. 

Let us try to briefly elucidate where the motivation for the backflow comes from. Con- 
sider the trial function of the famiUar form which we used a number of times before 'S't(R) = 
det[...]e^='"''', and let us apply the projector exp(— t'H) as given by 

exp(-r^)*T(R) = *t(R) - tH^t{B,) + 0{t'^) (156) 
In order to analyze the flrst order term we re-arrange the terms in the Hamiltonian 

n = T + Ve^t + Vee = T+ (Kc^t + Vmf) + (Ke - Kn/), (157) 

where Vmf is a one-particle mean-field approximation for Vee which corresponds to the theory 
used for generation of orbitals. A closer look on the first order term then leads to the following 

:«det[...]e^— =e^--(T + Kxt + Kn/)det[...] + det[...](T + - Vmf)e^^°^^ 

_ Ve^-"- -Vdet [...]. (158) 

Note that the first two terms should be close to ^t(^)Eioc{'R-), where E[oc(R-) is some function 
which is not very far from a constant. The reason is that in the first term the determinant solves 
the corresponding one-particle Hamiltonian, while in the second term the correlation is optimized 
to cancel out the difference Vee ~ Vmf - The third term appears to be "spurious" since it is not 
compensated by any potential. It has a form of the dot product of two "fluxes", one resulting 
from Vext + Vmf while the other from the electron-electron interactions. These terms need to be 
"cancelled" out what requires modifying the wave function or including additional terms. Which 
types of terms would be effective? We consider two limiting cases. The first one corresponds to 

|Vdet[...]| > |Ve^— I (159) 

with that the "flux" from the determinant having much larger amplitude so that the system is 
strongly inhomogeneous — ^typical behaviour for chemical systems and solid state materials. For 
these cases the excited determinants, Pfafflans, etc, appear as the most obvious, if not the most 
efflcient, way to cancel out the spurious component. On the other hand, if the inequahty goes the 
other way 



|Vdet[...]| < |Ve^'=""-'- 



(160) 
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then the system has density close to a constant or only mildly varying so that the electron-electron 
term dominates. This is typical for homogeneous electron gas, quantum liquids such ^He and 
^He, and similar systems. Feynman and Cohen showed that in this case the backflow coordinates 
can partially compensate for the spurious term, basically by sHghtly displacing a given particle 
to reflect the influence of particles which happen to be nearby. 



5.2 Backflow Wave Function Form 

As already discussed in Sec. 2.4.2, our trial wave function has the form 

*t(R) = *a(X) X exp[C/eo^^(R)], (161) 

where X = (xi, . . . , x^v) represents some quasi-particle coordinates dependent on all N elec- 
tron positions R. Further, 'f^ is either (multi)-determinantal or Pfaffian wave function and Ucorr 
is the Jastrow correlation factor both defined in the previous sections. 
The quasi-coordinate of ith electron at position is given as 

Xi = ri+^i(R) 

= r, + ^r(R) + C(R) + C"(R), (162) 

where is the ith electron's backflow displacement divided to the contributions from one-body 
(electron-nucleus), two-body (electron-electron) and three-body (electron-electron-nucleus) terms. 
They can be further expressed as 

^r{R) = J2uirij)rij (164) 



Cr"(R) = ^^[wi{rij,rii,rji)rij + W2{rij,rii,rji)rii], (165) 

where r^j = rj — and rn = rj — r/. The x. u and wi with 11)2 terms are similar to one, two 
and three-body Jastrow terms and are further expanded as 

X{r) = '^Ckak{r), (166) 

k 

u{r) =Y,dkbk{r), (167) 

k 

'Wi,2{rij,rii,rji) = ^ gkimak{rii)ai{rji)bm{rij). (168) 

klm 

The one dimensional basis functions {a} and {b} are chosen as Gaussians with the center in 
origin or as polynomial Fade functions (Sec. 2.4.3) to preserve the electron-electron [Eqs. (49) 
and (50)] and electron-nucleus [Eq. (48)] cusp conditions. The set of variational parameters 
{c}, {d} and {g} is minimized with respect to mixture of energy and variance (for details, see 
Sec. 7.2). In addition, all electron-electron coefficients {{dk} and {gkim} with fixed k and are 
allowed to be different for spin-like and for spin-unlike electron pairs. 
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5.3 Applications of Backflow Correlated Wave Functions 

In the remainder of this section we present VMC and DMC results obtained with above imple- 
mentation of backflow correlations for determinant and Pfaffian wave functions. 

5.3.1 Homogeneous electron gas 

We benchmark our implementation of the backflow correlations on the homogeneous electron 
gas (HEG) system. For comparison purposes, we choose to study the HEG system of 54 un- 
polarized electrons in the simple cubic simulation cell with periodic boundary conditions (the 
system was studied before several times [64, 1 19, 120]). Due to the homogeneity of the problem, 
the backflow displacement in Eq. (164) has only one non-zero component: ^1'^. We conveniently 
choose u{r) to be expanded in the basis of several polynomial Pade functions to preserve cusp 
conditions with cutoff equal to half of the simulation cell. Finally, our Jastrow correlation factor 
contains only electron-electron terms. 

In Table 5 we show our results for the foUowing three electron densities of = 1, 5 and 
20. The First important result is that at the normal density (r^ — 1), the size of correlation (e.i., 
Edmc — Ehf) is relatively smaU, but at lower densities (larger rg), the correlation energy is 
larger fraction of total energy and becomes more important. Second, we observe that the HF 
and Slater-Jastrow (SJ) fixed-node DMC energies are in very good agreement with previous 
results [64, 119, 120]. However, due to the omission of higher order correlations from Jastrow 
factor and also from backflow displacement, we obtain the expected higher VMC energies and 
variances for SJ and backflow displaced SJ (SJBF) trial wave functions. Nevertheless, our fixed- 
node DMC energies for SJBF trial wave functions closely match the results of Kwon et a/. [1 19], 
and only sUghtly deviate at = 20 from results of Rios et a/. [64]. 

5.3.2 Carbon atom and dimer 

The backflow correlations in single-determinant Slater-Jastrow (SJBF) [63, 64, 117] as well as 
in multi-determinant Slater-Jastrow (CIBF) [118] trial wave functions were recently applied to 
chemical systems. We extend these studies by including the backflow into the Pfaffian-Jastrow 
(PFBF) pairing wave functions. Below is a brief discussion of our implementation and results 
for carbon atom and dimer systems. 

We employ the inhomogeneous backflow given by Eq. (163), with the functions u and wi 2 
are allowed to be spin-dependent. The x and u functions are expanded in the 1 1 Gaussian basis 
functions, while the three-body functions wi 2 were limited to a product of 4 x 4 x 4 Gaussians. 

The numerical results are summarized in Tables 6 and 7. The backflow correlations are able 
to capture additional few percent of the correlation energy for both Slater-Jastrow and Pfaffian- 
Jastrow wave functions. Another important feature of backflow is 20-30% decrease in variances 
of local energy with respect to the wave functions without backflow correlations. We find that 
for the fully optimized backflow, the spin-unlike electron-electron functions are almost order 
of magnitude larger than spin-like ones as well as electron-nucleus functions. The gains are 
systematic with increasing number of parameters, however we do not find the three-body terms 
as important as reported in previous study [64]. This difference can be attributed to two main 
reasons — we use a different basis to expand the three-body fimctions wi^2 and we also eliminate 
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Tab. 5. VMC and fixed-node DMC energies per electron and variances of local energies for various trial 
wave functions (S, Slater; SJ, Slater-Jastrow; SJBF, backflow correlated SJ) for 3D unpolarized HEG of 54 
electrons. 





Method 


WF 


£;/Af[a.u./electron] 


o-^[a.u.^] 


1.0 


HF 


S 


0.56925(2) 


19.3(1) 




VMC 


SJ 


0.53360(4) 


1.26(4) 






SJBF 


0.53139(4) 


0.81(4) 




DMC 


SJ 


0.53087(4) 


- 






SJBF 


0.52990(4) 


- 


5.0 


HF 


S 


-0.056297(7) 


0.776(4) 




VMC 


SJ 


-0.075941(6) 


0.0636(1) 






SJBF 


-0.078087(4) 


0.0236(1) 




DMC 


SJ 


-0.07862(1) 








SJBF 


-0.07886(1) 




20.0 


HF 


S 


-0.022051(2) 


0.0482(1) 




VMC 


SJ 


-0.031692(2) 


0.000711(4) 






SJBF 


-0.031862(1) 


0.000437(1) 




DMC 


SJ 


-0.031948(2) 








SJBF 


-0.032007(2) 





atomic cores by pseudopotentials. It is plausible that for systems with core electrons the three- 
body correlations are more important due to the strong variations of orbitals close to the nucleus. 

Finally, let us discuss the difference between the two systems with respect to missing cor- 
relation energy. For the C atom, we have shown previously [1] that less than 100 determinants 
gives more than 99% of correlation energy {Ecorr)- The C dimer's fixed node errors are more 
pronounced, since the 148 determinants with re-optimized weights give only 97.5(1)% in a close 
agreement with recent calculations by Umrigar et flZ.[84]. Employing backflow correlations for 
our 148 determinant Cl-Jastrow wave function gives no apparent gain in E^orr except for de- 
crease in the variance of local energy. The improvement for the Pfaffian-Jastrow wave function 
is also very modest (less than 1%). Our results suggest that to reach beyond 99% of correlation 
one still needs complicated multi-reference wave functions, even after including quite general 
forms of the backflow correlations. 



5.4 Conclusions 

We have benchmarked our backflow correlated wave functions on the HEG system and achieved 
high level of agreement with previous results. Further, we have applied the Pfaffian pairing wave 
functions to chemical systems. The results for to testing cases of carbon pseudo atom and its 
dimer show promising gains in correlations energies, decreases in variances and improvements 
in the nodal structure. 
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Tab. 6. VMC and DMC energies and variances of local energy for Slater-Jastrow (SJ), Pfaffian-Jastrow 

(PF) and Cl-Jastrow (CI) trial wave functions with backflow (BF) correlations for C atom. Notation for 
backflow parameters: 2B, electron-nucleus and electron-electron terms; 3B, all electron-electron-nucleus 
terms; 23B for all terms together. 



Method 


WF 












E [a.u.] 


0-^ [a.u.^] 


'^corr [^^l 


HF 


S 


- 


- 


- 


- 


- 


-5.31471 


- 


0.0 


VMC 


SJ 


- 


- 


- 


- 


- 


-5.3990(1) 


0.0677 


81.8(1) 




SJBF2B 


11 


22 


- 


- 


33 


-5.4011(2) 


0.0544 


83.8(2) 




SJBF3B 


- 


- 


128 


128 


256 


-5.4023(3) 


0.0504 


85.0(3) 




SJBF23B 


4 


8 


128 


128 


268 


-5.4020(2) 


0.0498 


84.7(2) 




PF 












-5.4083(2) 


0.0626 


90.8(2) 




PFBF2B 


11 


22 






33 


-5.4097(1) 


0.0427 


92.1(1) 




PFBF23B 


4 


8 


128 


128 


268 


-5.4107(1) 


0.0411 


93.1(1) 




CP 












-5.4127(1) 


0.0447 


95.0(1) 




CIBF2B 


11 


22 






33 


-5.4131(3) 


0.0427 


95.4(3) 




C1BF23B 


4 


8 


128 


128 


268 


-5.4140(1) 


0.0342 


96.3(1) 


DMC 


SJ 












-5.4065(3) 




89.0(3) 




SJBF2B 


11 


22 






33 


-5.4090(3) 




91.5(3) 




SIBF3B 






128 


128 


256 


-5.4085(3) 




91.0(3) 




SJBF23B 


4 


8 


128 


128 


268 


-5.4094(3) 




91.8(3) 




PF 












-5.4137(3) 




96.0(3) 




PFBF2B 


11 


22 






33 


-5.4145(3) 




96.8(3) 




PFBF23B 


4 


8 


128 


128 


268 


-5.4152(3) 




97.5(3) 




CI 












-5.4178(1) 




100.0(1) 




CIBF2B 


11 


22 






33 


-5.4177(3) 




99.9(3) 




CIBF23B 


4 


8 


128 


128 


268 


-5.4174(2) 




99.6(2) 


Est. 


Exact 












-5.417806 




100.0 



'Wave function consists of 100 determinants re-optimized in VMC. 
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Tab. 7. Slater-Jastrow (SJ), Pfaffian-Jastrow (PF) and Cl-Jastrow (CI) wave functions with backflow (BF) 
correlations for C dimer. Notation is the same as in Table 6. 



Method 


WF 












E [a.u.] 


(7^ [a.u.^] 


-^corr [^] 


HF 


S 


- 


- 


- 


- 


- 


-10.6604 


- 


0.0 


VMC 


sr 


- 


- 


- 


- 


- 


-10.9936(4) 


0.179 


81.7(1) 




SJBF2B 


11 


22 


- 


- 


33 


-11.0012(3) 


0.144 


83.5(1) 




SJBF23B 


4 


8 


128 


128 


268 


-11.0014(2) 


0.141 


83.6(1) 
















-11.0171(2) 


0.160 


87.4(1) 




PFBF2B 


11 


22 






33 


-11.0223(3) 


0.123 


88.7(1) 




PFBF23B 


4 


8 


128 


128 


268 


-11.0223(2) 


0.128 


88.7(1) 




CP 












-11.0420(4) 


0.112 


93.6(1) 




CIBF2B 


11 


22 






33 


-11.0440(3) 


0.100 


94.0(1) 




CIBF23B 


4 


8 


128 


128 


268 


-11.0438(3) 


0.123 


94.0(1) 


DMC 


SJ 












-11.0227(2) 




88.8(1) 




SJBF2B 


11 


22 






33 


-11.0269(4) 




89.9(1) 




SJBF23B 


4 


8 


128 


128 


268 


-11.0280(3) 




90.1(1) 




PF 












-11.0419(9) 




93.5(2) 




PFBF2B 


11 


22 






33 


-11.0443(6) 




94.1(2) 




PFBF23B 


4 


8 


128 


128 


268 


-11.0447(3) 




94.2(1) 




CI 












-11.0579(5) 




97.5(1) 




CIBF2B 


11 


22 






33 


-11.0580(4) 




97.5(1) 




CIBF23B 


4 


8 


128 


128 


268 


-11.0585(5) 




97.7(1) 


Est. 


Exact 












-11.068(5) 




100.0 



^Slater determinant contains PBE DFT orbitals. 

''Same PBE DFT orbitals are used also in PF wave function. 

"^Uses natural orbitals with weights of the 148 determinants re-optimized in VMC. 
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6 Example of Large Scale Application: FeO Solid 

As an example of calculations, which are feasible at present using available both software and 
hardware resources, we briefly report our recent study of the FeO solid. 

FeO solid is a very interesting system which has been calculated a number of times before 
by several approaches. Nevertheless, it is difficult to claim that the problems and research ques- 
tions have been all "solved" and, in fact, our understanding of this and many other transition 
metal compounds is still rather limited. FeO belongs to the group of other paradigmatic transi- 
tion metal (TM) oxides such as MnO, CoO and NiO. All of these have a rather simple rocksalt 
(NaCl type) structure denoted as B 1 , with very small deformations at lower temperatures result- 
ing from effects such as magnetostriction. Since the transition elements in these systems have 
open d-shells, such solids should be nominally metals. However, for a long time it has been 
known and experiments have shown that these systems are large gap insulators. In addition, 
they exhibit antiferromagnetic ordering with the Neel temperatures of the order of a few hun- 
dred Kelvins. Interestingly, the gap remains present even at high temperatures when there is no 
long-range magnetic order. How is it possible? Using qualitative classification introduced by 
Zaanen and coworkers[121], there are basically two relevant and somewhat competing mecha- 
nisms: Mott-Hubbard and charge transfer. In the Mott-Hubbard picture, the gap opens because of 
large Coulomb repulsion associated with the double occupancy of the strongly localized d-states. 
The spin minority bands are pushed up in the energy, leading to gap opening. In the charge trans- 
fer mechanism, 4s electrons of the transition metal atom fill the unoccupied p-states of oxygen. 
The electronic structure has an almost ionic TM+^O^^ character, with resulting gap opening as 
well. In the real materials both of these mechanisms are present and therefore the systems exhibit 
insulating behavior. The antiferromagnetism happens on the top of this and is caused by weak 
super-exchange interactions between the moments of neighbouring transition metal atoms medi- 
ated by bridging oxygens. We will not go into further details, suffice it to say that the electron 
correlations are very large and crucially affect many properties of these materials. 

If one applies standard DFT approaches to FeO, the results are highly unsatisfactory. First, 
the atomic structure which comes out is not correct. Instead of the rocksalt antiferromagnet 
another structure, so-called iB8, appears to have the lowest energy at equilibrium conditions. 
Interestingly, iB8 is actually a high pressure phase of FeO. This discrepancy is surprising since 
DFT methods typically provide correct equilibrium structures and very reasonable geometric pa- 
rameters such as lattice constants and others. Another problem appears in the electronic structure 
since for the correct geometry DFT predicts a metallic state. More sophisticated methods beyond 
DFT have been applied to this system in order to reconcile some of the results with experiments, 
nevertheless, a number of questions remain unanswered. For example, at high pressures, FeO 
undergoes a structural transition into the iB8 phase, however, the value of the transition pressure 
which agrees with experiment is difficult to obtain using the mainstream approaches. Similar 
transition appears in MnO, where recent bechmarking of several DFT approaches provided tran- 
sition pressure estimates between 65 and 220 GPa, i.e., more than 300% spread in the predictions 
[122]. Large discrepancies with expriment exist also for the CoO crystal and more complicated 
transition metal compounds. 

We have carried out QMC calculations of FeO using supercells with periodic boundary con- 
ditions to model the infinite solid [123J. Several supercell sizes were calculated and k-point 
sampUng of the BriUouin zone was carried out by the so-called twist averaging with the purpose 
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Tab. 8. Comparision of the calculated structural properies of FeO solid in DFT and in the fixed-node DMC 
with experimental data. The energy difference Eiss-Esi is evaluated at the experimental lattice constant 
4.334 A. 



Method/quantity 



DFT/PBE FN-DMC 



Exper. 



EiBs-Esi [eV] 
cohesion energy [eV] 
lattice constant | Aj 
bulk modulus [GPa] 
band gap [eV] 



-0.2 [126] 

~ 11 
4.28 [124] 
180 [124] 

~0 



0.5(1) 
9.66(4) 
4.324(6) 
170(10) 

2.8(3) 



> 
9.7 [127] 
4.334[128J 
- 180 1125] 
2.4 [129] 



of eliminating finite size effects. The core electrons were replaced by pseudopotentials for both 
Fe (Ne-core) and O (He-core). The largest simulated supercells had more than 300 valence elec- 
trons and the total energies were sizeable due to the presence of "semicore" 3s and 3p states of 
Fe in the valence space, what have made the calculations rather demanding. The wave ftmction 
had the Slater-Jastrow form and the orbitals were obtained from unrestricted (spin-polarized) 
calculations within DFT with hybrid functionals and HF. 

In Tab. 8 are shown the QMC calculated equilibrium parameters. Note that QMC identifies 
the correct equilibrium structure and provides very accurate value of the cohesive energy. Cohe- 
sive energies are very difficult to calculate since the DFT methods show typical bias of 15-30%. 
At present, there is basically no other method besides QMC which can get this level of accuracy. 
Note also good agreement with experiments for the other quantities including the band gap. In 
QMC the band gap is calculated as a difference of two total energies — ground state and excited 
state, where the excited state is formed by promoting an electron from the top valence band into 
the conduction band. 

In addition, the equations of state have been calculated for both the equilibrium structure and 
also for the high pressure phase, see Fig. 16. The estimated transition pressure is 65(5) GPa, at 
the lower end of the experimental range 70-100 GPa [124, 125]. 

Finally, considering that only a simple trial wave function was employed, the results are 
remarkable and very encouraging. Note that the calculations do not have any free non- variational 
parameters. It is simply the best possible solution within the trial function nodes and the given 
Hamiltonian. 
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14 16 18 20 22 24 
V (A^/FeO) 

Fig. 16. FN-DMC Energy as a function of volume for FeO for Bl (red squares) and iB8 (blue cir 
phases. Lines are fits with Mumagham equation of state. 
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T 

Least abstraction 



Fig. 17. Calculation structure for the VMC method on a molecule using a Slater- Jastrow wave function. 



7 QWalk: A Quantum Monte Carlo Program for Electronic Structure 

The solution of the electronic structure of tnany electron systems in quantum Monte Carlo re- 
quires the use of computers and appropriate software tools. For the practitioners of the QMC 
methods, the computational aspects such as the use of efficient algorithms, flexible code organi- 
zation and parallel efficiency are important, since they influence which types of calculations and 
which types of physical effects are feasible to study. This section is therefore aimed to introduce 
a reader to one of the existing QMC packages, Open Source QWalk code available under the 
GPL hcence [53] (www.qwalk.org). 

This section is organized as follows. In Sec. 7.1 we overview the program's organization and 
structure. The description of the implementation of wave function optimization is in Sec. 7.2. 



7.1 Organization and Implementation 

QwaUc ("Quantum Walk") is a computational package for QMC calculations of electronic struc- 
ture systems such as molecules and solids. However, others systems can be calculated such as ho- 
mogeneous electron gas, BCS-BEC condensates, ID rigns with persistent currents, etc. The code 
is written in a combination of object-oriented and procedural techniques. The object-oriented ap- 
proach is coarse-grained, creating independent sections of code that are written efficiently in a 
procedural fashion. It is extremely modular; almost every piece can be removed and replaced 
with another. For example, a contributor of a new module only has to change one line in the 
main code to allow its use. This allows for flexibihty while keeping the code base relatively 
simple and separable. The modular structure also allows for partial rewrites of the code without 
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Tab. 9. The central objects of the code and their physical correspondents 



Module name 



Mathematical object 



System 
Sample point 
Wave function type 
Wave function 
Dynamics generator 



parameters and form of the Hamiltonian 

R, the integration variables 
Wave function ansatz 

*t(R), V*t(R), V2*t(R) 
Metropolis trial move 
(Green's function) 



worrying about other parts. In fact, each major module has been rewritten several times in this 
manner as we add new features and re-optimize the code flexibility and performance. 

The modules form a tree of successive abstractions (Fig. 17). At the top of the tree is the QMC 
method, VMC in this case. It works only in terms of the objects directly below it, which are the 
concepts of System, Wave function data, etc. (see Table 9). These in turn may have further 
abstractions below them. The highest wave function object is of type "Multiply", which uses 
two wave function types to create a combined wave function. In this case, it multipUes a Slater 
determinant with a Jastrow correlation factor to form a Slater- Jastrow function. Since various 
wave functions can be plugged-in, the Slater determinant can be replaced with any antisymmetric 
function. The same applies to the Jastrow factor. The type is Usted along with the specific instant 
of that type in parenthesis. At each level, the part in parenthesis could be replaced with another 
module of the same type. The example of the flowchart of the whole program is given in Fig. 19. 



7.2 Optimization Methods in QWalls 

Historically, the first implementation of minimization method in QWalk code was the OPTIMIZE 
method, based on the quasi-Newton minimizer. It has proved effective for variance minimization 
of linear and nonlinear Jastrow parameters, however, the energy minimizations require very large 
configuration samples. 

This deficiency was partially fixed by the OPT1M1ZE2 method, which uses modified Hes- 
sians and gradients of variance and energy of Ref . [74] together with the Levenberg-Marquardt 
minimization algorithm. The analytical gradients and Hessians with respect to selected vari- 
ational parameters were implemented (determinantal and orbital coefficients, but also Pfaffian 
and pair orbital coefficients). OPTIMIZE2 has proved to be very effective method and has been 
used for most of recent calculations. 

In an attempt to remove the dependence of OPT1M1ZE2 method on the fixed MC sample, the 
NEWTON OPT method was implemented as well. Its principal advantage is that the energy and 
variance are obtained from a small MC run performed after each optimization step. 

Besides the variance and energy minimizations, it is possible to minimize several other ob- 
jective or cost functions. Their incomplete list can be found in Table 10. As was recently pointed 
out by Umrigar and Filippi [74], the wave functions optimized by minimization of a mix of en- 
ergy and variance produce almost as low energy as energy optimized wave functions but with 
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Tab. 10. List of some useful cost functions for wave function optimizations with indicated implementations 
in QWalk [53]. 

Cost Function Minimized quantity QWalk implementation 

OPTIMIZE OPT1MIZE2 NEWTON_OPT 
Variance {{El - Ef) ^ 7 7 

Energy _ _ V V \/ 

Mixed xE + {I - x){{El - Ef) V V \/ 

Absolute value (|£'l - £^1) ^ 

Lorentz (ln(l + {El - Ef/2)) \j 

E 

J 'I'T*(-r->-oo) 

/-^^ 



Ratio 
Overlap 



lower variances. 

We illustrate this for the ground state wave function of the N2 molecule (see Fig 18). The em- 
ployed NEWTON-OPT method minimized 23 Jastrow parameters (the curvatures of correlation 
basis functions and all linear coefficients) of single determinant Slater- Jastrow wave ftinction 
on set of 56000 walkers. The minimized energy from mixed objective function is almost as 
good as from the energy optimization, while the corresponding dispersion a falls in between a 
from energy minimization and variance minimization. The wave functions obtained from mixed 
objective functions therefore appear as the the most efficient compromise for further DMC cal- 
culations. 

7.3 Conclusion 

QWalk represents a state of the art, usable, and extensible program for performing quantum 
Monte Carlo calculations on several types of quantum systems. It is able to handle medium to 
large systems of electrons; the maximum size is mostly limited by the available computer time. It 
works in parallel very efficiently, so it can take advantage of large clusters, multi-core computers, 
etc. Since QWalk is available without charge and under the GNU Public license, it is hoped that it 
will help bring both development and use of quantum Monte Carlo methods to a wide audience. 
Due to its modular form it is straightforward to expand the QWalk's applicability to quantum 
systems beyond the electron-ion Hamiltonians in continuous space. With some knowledge of 
C++ it is easy to modify the system module to incorporate other types of interactions and to 
expand the one-particle and pair orbitals using the coded basis functions. 
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10 20 30 40 

Iteration number 




10 20 30 40 

Iteration number 



Fig. 18. Energy E and dispersion of the local energy a of the N2 molecule versus the minimization step 
number. Notation: minimization of variance (green triangles), energy (blue circles) and 95% mixture of 
energy and 5% of variance (red squares). Upper figure: Energy versus the iteration number. The error bars 
are of the symbol size or smaller. Inset: the later iterations on expanded scale. Note the variance minimized 
energy being higher by almost 10 mH. Lower figure: Same as the upper figure but for the dispersion of 
local energy a rather than energy. Inset shows that the mixture minimized a falls in between a from energy 
minimization and variance minimization. 
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8 Summary 

We have presented an overview of QMC methods and some of the recent developments in the 
analysis of the fermion nodes, constructions of pairing wave functions and explorations of back- 
flow correlations. Most of the apphcations were using small systems both because of educational 
purposes and in some cases because these were the first benchmarks of the new developments. 
QMC methodology has proved to be a very powerful technique for studies of quantum systems 
and also real materials as illustrated on the FeO solid calculations. In essence, QMC has a num- 
ber of advantages when compared with other approaches: 

- direct and explicit many-body wave function framework for solving the stationary Schrodinger 
equation; 

- favorable scaUng with the systems size; 

- wide range of applicabiUty; 

- the fixed-node approximation which enables to obtain 90-95% of the correlation effects; 

- scalability on parallel machines; 

- new insights into the many-body phenomena. 

The remaining fixed-node errors account for small, but still important fraction of correla- 
tions. As we have shown, further improvement caUs for better understanding of the nodes and 
their efficient descriptions. It is encouraging that studies of the nodal properties brought new 
understanding such as the two nodal cell property for generic fermionic states and revealed new 
research directions which are yet to be explored. 

We have shown that the pairing orbitals and Pfaffian wave functions show significant im- 
provements in providing more systematic results and higher accuracy. The key virtue of the 
pairing functions is that they exhibit the correct nodal topologies and as such are capable to de- 
scribe quantum condensates and pairing effects. As we explained, the backflow has perhaps more 
Umited scope of usefulness and appears to be effective especially for homogeneous or marginally 
inhomogeneous systems. Many generalizations of QMC exist. For example, for systems with 
magnetic fields and stationary currents the fixed-node methods can be generalized to the fixed- 
phase QMC [130, 131]. For finite temperature equilibrium properties Path Integral Monte Carlo 
has been used very successfuly especially for bosonic quantum hquids. 

The overall impact of the electronic structure QMC methods has been so far mainly in provid- 
ing high accuracy and benchmark results which were used by other methods as a reference. We 
believe that the impact of QMC methods is on the rise and their use will gradually become more 
routine. Already at present, there are cases for which developing elaborate mean-fields take more 
human time than direct solutions by many-body stochastic approaches. However, as we stated in 
the introduction, much needs to be done to make the methods more insightful, efficient and easy 
to use. 

Acknowledgements. Support by NSF EAR-05301110 and DMR-0804549 grants and by 
DOE DE-FG05-08OR23336 Endstation grant is gratefully acknowledged. We acknowledge also 
allocations at ORNL through INCITE and CNMS initiatives as well as allocations at NSF NCSA 
and TACC centers. We would also Uke to thank L.K. Wagner and J. Kolorenc for discussions. 



84 



M. Bajdich and L. Mitas 



References 

[1] M. Bajdich, L. Mitas, G. Drobny, L. K. Wagner, and K. E. Schmidt, Phys. Rev. Lett. 96, 

130201 (2006). 

[2J M. Bajdich, L. Mitas, L. K. Wagner, and K. E. Schmidt, Phys. Rev. B 77, 115112 (2008). 

L3] J. Grossman, J Chem. Phys. 117, 1434 (2002). 

[4] S. Manten and A. Liichow, J. Chem. Phys. 115, 5362 (2001). 

[5] W. M. C. Foulkes, L. Mitas, R. J. Needs, and G. Rajagopal, Rev Mod. Phys. 73, 33 (2001). 

[6] A. J. Wilhamson, R. Q. Hood, and J. C. Grossman, Phys. Rev. Lett. 87, 246406 (2001). 

[7] A. J. Wilhamson, G. Rajagopal, R. J. Needs, L. M. Eraser, W. M. C. Eoulkes, Y. Wang, 
and M.-Y. Chou, Phys. Rev. B 55, R4851 (1997). 

[8] R. M. Martin, Electronic Structure, Basic theory and practical methods (Cambridge Uni- 
versity Press, Cambridge, UK, 2004). 

[9] W. Heitler and R London, 44, 455 (1927). 
[101 D. R. Hartree, Proc. Cambridge Philos. Soc. 24, 89 (1928). 
[11 J E. A. Hylleraas, Z. Phys. 65, 209 (1930). 
[12] E. A. Hylleraas, Z. Phys. 54 (1929). 
[13] V. Pock, Z. Phys 61 (1930). 
[14] W. PauH, Z. Phys. 41, 81 (1927). 
[15] J. C. Slater, Phys. Rev. 35, 210 (1930). 
[16] E Bloch, Z. Phys. 52, 555 (1929). 
[17] E. Wigner and R Seitz, Phys. Rev. 43, 804 (1933). 
[18] E. Wigner and R Seitz, Phys. Rev. 46, 509 (1934). 
[19] P Hohenberg and W. Kohn, Phys. Rev. 136, B864 (1964). 
[20] W. Kohn and L. J. Sham, Phys. Rev. 140, Al 133 (1965). 
[21] J. Cizek, Adv. Chem. Phys. 14, 35 (1969). 

[22] A. Szabo and N. S. Ostlund, Modern Quantum Chemistry: Introduction to Advanced Elec- 
tronic Structure Theory (Dover, 1996). 

[23] D. M. Ceperley and B. J. Alder, Phys. Rev. Lett. 45, 566 (1980). 

[24] A. D. Becke, J. Chem. Phys. 98, 5648 (1993). 

[25] A. D. Becke, Phys. Rev. A 38, 3098 (1988). 

[26] C. Lee, W. Yang, and R. G. Parr, Phys. Rev B 37, 785 (1988). 

[27] J. P Perdew, K. Burke, and M. Ernzerhof, Phys. Rev. Lett. 77, 3865 (1996). 

[28] T. Ando, Phys. Rev. B 13, 3468 (1976). 

[29] A. Zangwill and P Soven, Phys. Rev. A 21, 1561 (1980). 

[30] O. Gunnarsson, M. Jonson, and B. I. Lundqvist, Phys. Rev. B 20, 3136 (1979). 

[31] J. R Perdew and A. Zunger, Phys. Rev. B 23, 5048 (1981). 

[32] V. L Anisimov, J. Zaanen, and O. K. Andersen, Phys. Rev. B 44, 943 (1991). 

[33] N. Metropolis, A. W. Rosenbluth, M. N. Rosenbluth, A. H. Teller, and E. Teller, J. Chem. 
Phys. 21, 1087 (1953). 

[34] M. H. Kalos and P A. Whitlock, Monte Carlo Methods (Wiley, New York, 1986). 

[35] C. Eihppi and C. J. Umrigar, Phys Rev B 61, R16291 (2000). 



Quantum Monte Carlo 



85 



[36] P. J. Reynolds, D. M. Ceperley, B. J. Alder, and W. A. Lester Jr., J. Chem. Phys. 77, 5593 
(1982). 

[37] R. C. Grimm and R. G. Storer, J. Comp. Phys. 7, 134 (1971). 

[38] D. M. Ceperley and M. H. Kalos, in Monte Carlo Methods in Statistical Physics, edited 

by K. Binger (Springer, Berlin, 1979), pp. 145-194. 
[39] C. Umrigar, M. Nightingale, and K. Runge, J. Chem. Phys. 99, 2865 (1993). 
[40] J. B. Anderson, J. Chem. Phys. 63, 1499 (1975). 
[41] J. B. Anderson, J. Chem. Phys. 65, 4121 (1976). 

[42] J. W. Moskowitz, K. E. Schmidt, M. A. Lee, and M. H. Kalos, J. Chem. Phys. 77, 349 
(1982). 

[43] D. M. Ceperiey and B. J. Alder, J. Chem. Phys. 81, 5833 (1984). 

[44] D. M. Arnow, M. H. Kalos, M. A. Lee, and K. E. Schmidt, J. Chem. Phys. 77, 5562 (1982). 

[45] S. Zhang and M. H. Kalos, Phys. Rev. Lett. 67, 3074 (1991). 

[46] R. Assaraf, M. Caffarel, and A. Khelif, J. Phys. A-Math. Theor 40, 1 181 (2007). 

[47] J. B. Anderson, C. A. Traynor, and B. M. Boghosian, J. Chem. Phys. 95, 7418 (1991). 

[48] D. L. Diedrich and J. B. Anderson, Science 258, 786 (1992). 

[49] R. Bianchi, D. Bressanini, P Cremaschi, and G. Morosi, J. Chem. Phys. 98, 7204 (1993). 

[50] M. Troyer and U.-J. Wiese, Phys. Rev. Lett. 94, 170201 (2005). 

[51] T. Kato, Commun. Pure Appl. Math. 10, 151 (1957). 

[52] R. T. Pack and W. B. Brown, J. Chem. Phys. 45, 556 (1966). 

1 53 1 L. K. Wagner, M. Bajdich, and L. Mitas, J. Comp. Phys. 228, 3390 (2009). 

[54] R. Jastrow, Phys. Rev. 98, 1479 (1955). 

[55] J. W. Moskowitz and K. E. Schmidt, J. Chem. Phys. 97, 3382 (1992). 

[56] C. J. Umrigar, K. G. Wilson, and J. W. Wilkins, Phys. Rev. Lett. 60, 1719 (1988). 

[57] R. P Feynman and M. Cohen, Phys. Rev 102, 1189 (1956). 

[58] K. E. Schmidt, M. A. Lee, M. H. Kalos, and G. V. Chester, Phys. Rev. Lett. 47, 807 (1981). 

[59] R. M. Panoff and L Carlson, Phys. Rev. Lett. 62, 1130 (1989). 

[60] Y. Kwon, D. M. Ceperiey, and R. M. Martin, Phys. Rev. B 48, 12037 (1993). 

[61] Y. Kwon, D. M. Ceperiey, and R. M. Martin, Phys. Rev. B 50, 1684 (1994). 

[62] Y. Kwon, D. M. Ceperiey, and R. M. Martin, Phys. Rev. B 53, 7376 (1996). 

[63] N. D. Drummond, P. L. Rios, A. Ma, J. R. Trail, G. G. Spink, M. D. Towler, and R. J. 

Needs, J. Chem. Phys. 124, 224104 (2006). 
[64] P L. Rios, A. Ma, N. D. Drummond, M. D. Towler, and R. J. Needs, Phys. Rev. E 74, 

066701 (2006). 

[65] D. M. Ceperley, Proceedings of the Metropolis Symposium on The Frontiers of Quantum 

Monte Cario, J. of Stat. Phys. 43 (1986). 
[66] B. L. Hammond, P J. Reynolds, and J. William A. Lester, J. Chem. Phys. 87, 1 130 (1987). 
[67] L. Mitas, E. L. Shirley, and D. M. Ceperley, J. Chem. Phys. 95, 3467 (1991). 
[68] L. Mitas, Phys. Rev A 49, 4411 (1994). 
[69] H.-J. Flad and M. Dolg, J. Chem. Phys. 107, 7951 (1997). 

[70] H. J. M. van Bemmel, D. F. B. ten Haaf, W. van Saarloos, J. M. J. van Leeuwen, and 
G. An, Phys. Rev. Lett. 72, 2442 (1994). 



86 



M. Bajdich and L. Mitas 



[71] D. F. B. ten Haaf, H. J. M. van Bemmel, J. M. J. van Leeuwen, W. van Saarloos, and D. M. 

Ceperley, Phys. Rev. B 51, 13039 (1995). 
[72] M. Casula, C. Filippi, and S. Sorella, Phys. Rev. Lett. 95, 100201 (2005). 

[73] R. Fletcher, Report R7125 AERE, Harwell, ENGLAND (1972), harwell routine VAIOA. 

[74] C. J. Umrigar and C. Filippi, Phys Rev Lett 94, 150201 (2005). 

[75] X. Lin, H. Zhang, and A. M. Rappe, J. Chem. Phys. 112, 2650 (2000). 

[76] K. Madsen, H. B. Nielsen, and O. Tingleff, Methods for non-linear least squares problems 

(Informatics and Mathematical Modeling, Technical University of Denmark, 2004), 2nd 

ed. 

[77] G. Breit, Phys. Rev. 35, 569 (1930). 
[78] E. A. Hylleraas, Z. Phys. 48 (1928). 
[79] J. B. Anderson, Phys. Rev. A 35, 3550 (1987). 

[80] W. A. Glauser, W. R. Brown, W. A. Lester, Jr., D. Bressanini, B. L. Hammond, and M. L. 

Kozykowski, J. Chem. Phys. 97, 9200 (1992). 
181] W. M. C. Foulkes, R. Q. Hood, and R. J. Needs, Phys. Rev. B 60, 4558 (1999). 
[82] D. Bressanini, D. M. Ceperley, and P. Reynolds, in Recent Advances in Quantum Monte 

Carlo Methods 2, edited by S. T. W. A. Lester Jr., S. M. Rothstein (World Scientific, 

Singapore, 2002). 

[83] D. Bressanini, G. Morosi, and S. Tarasco, J. Chem. Phys. 123, 204109 (2005). 

[84] C. J. Umrigar, J. Toulouse, C. FiUppi, S. Sorella, and R. G. Hennig, cond-mat/0611094 

(2006). 

[85] M. Casula and S. Sorella, J. Chem. Phys. 119, 6500 (2003). 
[86] M. Casula, C. Attaccalite, and S. Sorella, J. Chem. Phys. 121, 7110 (2004). 
[87] D. M. Ceperley, J. Stat. Phys. 63 (1991). 
[88] L. Mitas, Phys. Rev. Lett. 96, 240402 (2006). 
[89] L. Mitas, cond-mat/0605550 (2006). 
[90] K. Uhlenbeck, Amer. J. Math. 98, 1059 (1976). 
[91] D. Bressanini and P J. Reynolds, Phys. Rev. Lett. 95, 110201 (2005). 
[92] M. Bajdich, L. Mitas, G. Drobny, and L. K. Wagner, Phys. Rev. B 72, 075131 (2005). 
[93] I. Ovcharenko, A. Aspuru-Guzik, and W. A. Lester, Jr., J. Chem. Phys. 114, 7790 (2001). 
[94] A. K. Wilson, T. V. Mourik, and T. H. Dunning, Jr., J. Mol. Struc: THEOCHEM 388, 339 
(1997). 

[95] W. Kutzelnigg, in Explicitly Correlated Wave Functions in Chemistry and Physics, edited 
by J. Rychlewski (Kluwer, Dordrecht, 2003), pp. 44-82, and references therein. 

[96] K. Rektorys, Survey of Applicable Mathematics (Kluwer, Dordrecht, 1994). 

[97] J. Carlson, S.-Y. Chang, V. R. Pandharipande, and K. E. Schmidt, Phys. Rev. Lett. 91, 
050401 (2003). 

[98] J. P Bouchaud and C. L'huillier, Europhys. Lett. 3, 1273 (1987). 

[99] M. Bhattacharjee, Z. Phys. B 82 (1991). 
[100] A. Fabrocini, S. Fantoni, A. Y. lUarionov, and K. E. Schmidt, Phys. Rev. Lett. 95, 192501 

(2005), also in arXiv.org, nucl-th/0607034. 
[101] A. Cayley, Cambridge and Mathematical Journal 7, 40 (1852), reprinted in The collected 



Quantum Monte Carlo 



87 



mathematical papers of Arthur Cayley, Cambridge [Eng.] The University Press, Cam- 
bridge, vol. 2, pp. 19(1889). 

[102] M. Nakahara, Geometry, Topology and Physics (Institute of Physics, Bristol, 2003). 

[103] J. R. Stembridge, Adv. Math. 83 (1990). 

[104] A. Cayley, J. Reine Angew. Math. 38, 93 (1849), reprinted in The collected mathematical 
papers of Arthur Cayley, Cambridge [Eng.] The University Press, Cambridge, vol. 1, pp. 
410-413 (1889). 

[105] F. R. Gantmacher, The Theory of Matrices, vol. 1 (Chelsea, New York, 1959). 

[106] G. Galbiati and F. Maffioli, Discrete Apphed Mathematics 51, 269 (1994). 

[107] M. Bajdich, Ph.D. thesis. North Carohna State University (2007). 

[108] J. R Bouchaud, A. S. George, and C. L'huilHer, J. Phys. France(Paris) 49 (1988). 

[109] J. R Bouchaud and C. L'huillier, Physica C: Superconductivity 153, 1169 (1988). 

[110] J. Bardeen, L. N. Cooper, and J. R. Schrieffer, Phys. Rev. 108, 1175 (1957). 

[Ill] W. J. Stevens, H. Basch, and M. Krauss, J. Chem. Phys. 81, 6026 (1984). 

[1 12] M. Dolg, Chemical Physics Letters 250, 75 (1996). 

[1 13] M. W. Chase, Jr., NIST-JANAF Tables 4, vol. 9 (AIP, 1998). 

[114] Y. B. Randall S. Urdahl and W. M. Jackson, Chemical Physics Letters 178, 425 (1991). 
[115] K. P. Huber and G. Herzberg, Molecular Spectra and Molecular Structure: Constants of 

Diatomic Molecules, vol. 4 (Van Nostrand Reinhold, New York, 1979). 
[116] L. Bytautas and K. Ruedenberg, J. Chem. Phys. 122, 154110 (2005). 
[1 17] I. G. Gurtubay and R. J. Needs, J. Chem. Phys. 127, 124306 (2007). 
[118] M. D. Brown, J. R. Trail, P L. Rios, and R. J. Needs, J. Chem. Phys. 126, 224110 (2007). 
[1 19] Y. Kwon, D. M. Ceperley, and R. M. Martin, Phys. Rev. B 58, 6800 (1998). 
[120] M. Holzmann, D. M. Ceperiey, C. Pierleoni, and K. Esler, Phys. Rev. E 68, 046707 (2003). 
[121] J. Zaanen, G. A. Sawatzky, and J. W. Allen, Phys. Rev Lett. 55, 418 (1985). 
[122] D. Kasinathan, J. Kunes, K. Koepemik, C. V. Diaconu, R. L. Martin, I. Prodan, G. Scuse- 

ria, N. Spaldin, L. Petit, T. C. Schulthess, et al., Phys. Rev. B 74, 195110 (2006). 
[123] J. Kolorenc and L. Mitas, Phys. Rev. Lett. 101, 185502 (2008). 
[124] Z. Fang, I. V. Solovyev, H. Sawada, and K. Terakura, Phys. Rev. B 59, 762 (1999). 
[125] J. Zhang, Phys. Rev. Lett. 84, 507 (2000). 

[126] Z. Fang, K. Terakura, H. Sawada, T. Miyazaki, and I. Solovyev, Phys. Rev. Lett. 81, 1027 

(1998). 

[127] in CRC Handbook of Chemistry and Physics, edited by D. R.Linde (CRC Press, Boca 
Raton, FL, 2007). 

[128] C. A. McCammon and L. Liu, Phys. Chem. Minerals 10, 106 (1984). 

[129] H. K. Bowen, D. Adler, and B. H. Auker, J. Solid State Chem. 12, 355 (1975). 

[130] G. Ortiz, D. M. Ceperley, and R. M. Martin, Phys. Rev. Lett. 71, 2777 (1993). 

[131] P Vagner, M. Mosko, R. Nemeth, L. K. Wagner, and L. Mitas, Physica E 32, 350 (2005). 



